(12) INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) Worid Intellectoal Property OrganizatioD 

International Bureau 

(43) Intematiooal Pablication Date 
10 October 2002 (10.10.2002) 




PCT 



liiil 



(10) lotematiooal Publication Number 

wo 02/079415 A2 



(51) International Patent ClasslficatloB^: 



C12N 



(21) International Application Number: PCr/US02/09650 

(22) International Filing Date: 29 Maxell 2002 (29.03.2002) 



(25) Filing Language: 

(26) Publication Language: 



Kiglisli 
English 



(30) Priority Data: 

60/280.625 30 Mardi 2001 (30.03.2001) US 

(71) Applicant: LEXIGEN PHARMACEUTICALS CORR 

[US/US]; 125 Haxtwell Avenne, Lexington, MA 02421- 
3125 (US). 

(72) Inventor: GILLIES, Stephen, D.; 159 Sunset Road, 
Cariisle,MA0l741(US). 

(74) Agent: WALLER, Patrick, It, H; Testa, Hoiwitz & 
Thibeault, LLP, High Street Tower, 125 lUgh Street, 
Boston, MA 02110 (US). 



(SI) Designated States (natiomxl): AE, AG, AL, AM, AT. AU, 
AZ, BA, BB.BO, BR, BY, B2, OA, CH, ON, CO, CR, CU, 
CZ, DB, DK. DM, DZ, EC, EE. ES, H, GB. GD. GE. GH, 
GM. HR, HU, ID, EL, IN, IS, JP, KB. KG. KP. KR, KZ,LC 
LK. LR. LS, Ur, LU, LV, MA, MD. MG, MK. MN. MW, 
MX. MZ. NO. NZ, OM. PH. PL. FT. RO. RU. SD, SB, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ. UA, UO, UZ, VN. 
YU,ZA,ZM.ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KB. LS, MW. MZ, SD. S1-, SZ, TZ. UG, ZM. ZW), 
Eurasian patent (AM. AZ. BY, KG, KZ, MD. RU. TJ, TMX 
European patent (AT. BE, CH. CY. DE, DK, ES, H. FR, 
GB, GR, IE, rr. LU, MC NL. FT, SB. TR). OAH patent 
(BF, BJ, CF, CG, a. CM, GA, GN, GQ, GW, ML, MR, 
NB,SN,TD.TG). 

Pabiished: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" igspearing at the begin- 
ning cf each regular issue (f the PCT Gazette, 



IS 
< 

Tf 

ON 

^ (54) Title: REDUCING THE IMMUNOGENICITY OF FUSION PROTEINS 

2 (57) Abstract: Disclosed are compositions and methods for producing fusion proteins with reduced inununogenicity. Fusion pro- 
teins of the invention include a junction region having an amino acid change that reduces the ability of a junctional epitope to bind 

Q to MHC Qass U. thereby reducing its interaction with a T-ccU receptor. Methods of the invention involve analyzing, changing, or 
modifying one or more amino adds in the junction region of a fusion protein in order to identify a T-cell epitope and reduce its 

^ abUity to interact with a T-cell receptor, Compositions and methods of the invention are useful in therapy. 
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REDUCING THE IMMUNOGENIOTY OF FUSION PROTEINS 

5 Related Applicafioiis 

[0001] This qipfication claims i»iority to and the beaefit of U.S. provisional patent 
application serial number 60/280.625, filed March 30, 2001, the entire disclosure of 
which is incorporated herein by reference. 

Fidd of the Invention 
10 [0002] The present inveaition relates gmmlly to mediods and oonqx>sitions for 
making 

and using modified fiision proteins wifii reduced or no immunogenidty as Iher^eutic 
agCTts. More specifically, die invention relates to fiision proteins, made less 
immunogenic by identifying candidate T-cell epitopes and modifying the amino acid 
15 sequence to eliminate such q)itopes. 

Badcgroand of the Invention 
[0003] . Many therapeutic proteins are normal human proteins. For example, 
interleuJdn- 

2, erythropoietin, and growdi hormone are all human proteins that are given to 
20 humans who already usually make endogenous levels of these proteins. In general, 
immune req)onses against completely normal human proteins are rare when these 
proteins are used as therspeudcs. 

[0004] Recently it has become parent diat many fiision proteins with artificial 
activities 

25 are usefid as dierapeutic proteins. For example^ Bnhrel is a fiisicm of the extracellular 
domain of a TNF receptor widi an IgGlFcregioit Bnbrel is used to treat rheumatoid 
arthritis, and is thought to fimction by titrating TNF and preventing TNF actioxL 
However, a significant incidmce of anti-Enbrel antibodies have been noted in patients 
treated witii EnbreL 

30 [0005] Another example of a th^peutically usefiil class of fiision proteins is die 
immunocytokines. These proteins include an antibody moiety and a cytokine moiety, 
and are usefiil for targeting cytokines to diseased cells, such as cancer cells. 
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liuw-^v ci» the therq)eixtic use of many of these fusion pioteins is curtailed due to their 
inmunogeDicity in mammals, especially humans. 

[0006] Therefore, there is a need to generate fusion pioteins with reduced 
5 immunogeuicity in order to use these proteins in Ifaaapy. 

Summary of the Invention 
[0007] The present invention features methods and compoations usefid for producing 
fusion pn>teins with reduced immunogemdty for use in tfierq^ For exan^le, flie 
invention features immunocytokines, immunofiians, immunoligands, other antibody 
10 and Fc fusion proteins, cytokme-cytokine fiision protems, and albumin fusion proteins 
with decreased inununogenidty. 

[0008] The invention relates in part to the insight fliat fusion proteins contain 
sequences 

that are *^n-self/* For example, even in a fusion between two human proteins, the 

IS region surrounding the fusion junction comprises a peptide sequence that is not 
normally present in file human body. For example, a protein drug sudi as Enbrel is 
derived fiom two normal human protdns: TNF receptor and IgGl. However, the 
junction between TNF receptor and IgGl is a peptide sequence that is not normally 
found in the human body. 

20 [0009] Preferred methods of the invention mvolve reducing the immunogemdty of a 
fusion protein by reducing fbs ability of a junctional q>itope functional pqptide) to 
interact with a T-cell recq>tor by reducing its ability to bind (its binding affinity) to 
MHC molecules. According to the invention, the junctional epitope or peptide is 
preferably ^on-self." In gen«:al, proteins, including therapeutic proteins, are 

25 immunogenic, in part 

because protdns are endocytosed by antigen-presenting cells and proteolyzed, and tiie 
resulting peptides bind to molecules called major faistocon^iatibility complex (MHC) 
tiiat preset flie pq>tides to T cells. The antigranc pq>tide - MHC complex on the 
sur&ce of an antigen presenting cell (APQ activates T-cells to proliferate, 

30 diJSerentiate and release cytokines, hi parallel, B-celldifiTerentiation and antibody 
production is induced which may fiirtiier limit the therapeutic protdn's effectiveness 
due to clearance. Thus, tiie antigenic pq[>tide, if derived fiom a ther^eutic protein, is 
capable ofindudng a series ofundesiredunmune responses. Thether^eutic 
protein's effectiveness is limited due to titration by antibodies, and tiie induction of T- 
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cell and B-cell lesponses is often deleterious due to inflanomatoiy and allergic 
leacdons in the patient 

[0010] The invention provides (1) the identification of novel amino acid sequences in 
the region of flie inununoglobulin - target protein junction with one or more candidate 
5 T-cell epitopes; and (2) the modification of these amino add sequences to reduce or 
eliminate the presence of pq>tides, d^ved fiom the junction sequence, ttiat fimction 
asT-cellq)itqpes. 

[0011] The invention provides two general classes of com^sitions and mediods 
relating 

10 to tiie reduction of innnunogenicity. According to one embodiment of the invffltion, 
potential non-self T-cell q>itopes are identified in sequences that span a fiision 
junction. For example, potential non*self T-cell epitopes are identified by 
computational methods based on modeling pq>tide binding to MHC Class II 
molecules. Substitutions are then made such that flie ability of pq>tides deriving fiom 

IS &e junction region to bind to MHC Class n is reduced or eliminated. Thisprocessof 
identifying and modifying peptides which bind to MHC Class n is termed **de- 
inununizafion" and the resultant modified protein molecules are tmned '*de- 
immunized.'' 

[0012] According to another embodiment of the invention, one or more glycosylation 
20 sites is introduced at a fiision junction. An N*linked glycosylation site is preferably 
used, although an O-linked glycosylation site may also be used. According to a 
preferred embodiment, amino acids in a junction region surrounding a fiision junction 
of wild-type sequrace are mutated such that the last amino add of die N-t^minal 
fiision partner is mutated to an asparagine, and the first two amino acids of the second 
25 fiision partner are mutated to a glycine fi>llowed by a serines a threonine. 

[0013] According to the invention, removal of MHC Class 11 binding is preferred in 
situations where a protein is to be produced in bacteria or in an organism tiiat does not 
generate a mammalian glycosylation pattern, such as yeast or insect cells. 
[0014] The introduction of glycosylation sites may be preferred when the protein is to 
30 be 

produced in a mammalian ceU line or in a cell line that creates a glycosylation pattern 
tiiat is innocuous to mammals. 

[0015] In a preferred embodiment, a component of tiie fiision protein is a cytokine. 
The 
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tenu ^'cytoldne'' is used herein to desoibe naturally occuning or recombinant 
proteins, analogs thereof and jBragmoits thereof that elicit a specific response in a cell 
that has a recq)tor for that cytokine. Preferably, cytokines are proteins ttiatxiiay be 
produced and excreted by a cell. Preferably, cytokines include interleukins such as 
5 interieukin-2 (IL-2), IL-3, IL-4, JL-S, IL^, IL-?, n^lO, IL-12, 11^13, IH4, E^-IS, 
IL-16 and IL-18, hematqxnetic factors such as granulocyte-macrophage colony 
stimulating factor (GM-CSF), G-CSF and eryOxropoietin, tumor necrosis factors 
(TNF) such as TNFo, lymphokines such as lymphotoxin, regulators of metabolic 
processes such as leptin, and interferons such as interferon a, interferon and 
10 interfen>n7andchemokines. 

Preferably, the antibody-cytokine fusion protein of &e present invention displays a 
cytokine specific biological activity. 

[0016] In another prefecred embodiment, a component of the fiision protein is an anti- 
obesity cytokine. For example, a conq>onent is leptin, CNTF, or a portion of Acrp30. 
15 [0017] hi an altemative preferred embodiment, a component of the fiision protein is a 
hormone. For example, a component may be insulin, growth hormone, or glucagon- 
likepq[>tidel(GLP-l). 

[0018] in yet another alternative ^bodiment, a component of ttie fusion protdn is a 
hgand-bindingprotdn with biological activity. In a preferred embodunent, an 

20 extracellular domain of TNF receptor is used* 

[0019] According to one series of embodiments, a fiision protem of fiie invention 
comprises the N-tenninus of a non-antibody moiety fused to the C-temiinus of an 
antibody moiety. According to another s^es of embodiments, a fiision protein of the 
invention comprises the C-terminus of a non-antibody moiety fiised to the K-tCTnnus 

25 of an antibody moiety. According to the invention, an antibody moiety can be an 
intact immunoglobulin or a portion of an intact immunoglobuliiL A portion of an 
immunoglobulin can include a variable region or a constant region or botii. Preferred 
immunoglobulins include Fc regions or portions thereof A preferred embodiment of 
tile invention includes an IgGl immunoglobulin isotyp^ or a portion thereol^ 

30 modified to be less immunogenic and/or to have a longer serum half-life. For 

example, an IgGI with modification of amino acid residues near the CH3 - cytokhae 
junction is preferred. For certain applications* antibody moieties from IgG2 or IgG4 
isotypes are preferred. 
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[ftQlQ] bamunocytokines are only one example oif a tumor-targeted fusion protein 
fher^y. Other tumor-toxic molecules can also be targeted to tumors by fusion to 
tumor-specific antibodies, la addition, antibody fusion proteins can attack other types 
of diseased cells, such as virus-infected cells. Anotha qyproach to engineering 
5 targeted fusion proteins has been use ofFc-X and X-FctedmologyvvliereX is a 
polypeptide. These teduK)logies utilize fteloiowledge that productions^ 
of a target protein is improved if tiie polypeptide of interest is linked to the Fc portion 
of an ixnmunogilobulin. For Fc-X fusion proteins, a signal peptide^ followed by the Fc 
fiagment of an immunogllobulin gene is the N-tenninal fusion partner to the target 

10 protein. In some instances it is spedficaUy advantageous to engineer a fusion protdn 
m the X-Fc orientation. Wifli these constructs Ae target protein is the N-temiinal 
fiision protein and the Fc fragment follows. For some protems this approach is usefiil, 
as has been shown with lymphocyte cell surface glycoprotein (LHR) (US patent 
5,428,130), and glucagon-like peptide (GLP-1). 

15 [0Q21] Accordingly, methods and compositions of the invention provide fonns of Fc- 
X 

and X-Fc fusion proteins with reduced-immunogenidty. According to tiie invention, 
the immunogenicity of a fusion protein can be assayed according to a method known 
in the art or disclosed herein. 
20 [0022] Methods and compositions of the invention also provide albumin fusion 
proteins 

wifli reduced immunog^city. Human serum albumin (HSA), due to its remarkably 
long half-life, its wide in vivo distribution and its lack of enzymatic or immunological 
ftnctions, has bem used as a carrier for iher^eutic pq>tides^roteins (Yeh et al, 

25 PNAS 89:1904-1908, 1992). A genetic fusion of a bioactive pq^tide to HSA is useful 
for recovery of a secreted th^apeutic HSA derivative. However, according to the 
invention, albumin fusion proteins such as HS A-CD4 have a novel junction which 
generally contains one or more T-cell q>itopes cq>able of being presented on MHC 
class n molecules. The invmtion provides less immunogenic forms of albumin 

30 fusion proteins, and general methods for reducing the immunogenicity of albumin 
fiision proteins. According to the invention, useful albumin proteins include species, 
allelic, and mutant variants of albumin, including firagments thereof. Preferred 
albumin proteins retain the structural and functional properties of a wild-type albumin 
protein such as HSA. 
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[0023] in another aspect fhe invention provides de-inmnmized antibody fiision 
proteins 

' with nonnal, mutant, or hybrid isotypes that comprise usefidm These 
mutations may be near die junction or at positions distinct fiom the region of the 
5 junction. 

[0024] For example, the invention provides a de-immunized immunocytokine, 
modified 

at the junction, with a point mutation at fhe junction between the IgG and non-IgG 
moieties. Ilie cytoldnemoi^ includes any cytoldne but preferably IL-2 In 

10 one embodiment, the amino add changes involve changmg the C-tenninal fysine of 
the antibody moiety to a hydrophobic amino add such as alanine or leucine; Akey 
advantage of combimng such mutations with a de-immunizing modification of tbe 
invention is fbat die mutations act together to increase serum half-life and to decrease 
immunogenicity. The mediods described herein for combining de-immunization of a 

IS fusion junction with a serum-half-life altering mutation are useful to improve 
significantly the clinical efficacy of these fusion proteins. 
[0025] In anodier aspect, die invention provides immunocytokines comprising a 
hybrid 

antibody moiety that includes domains fiom difiTerent Ig isotypes, preferably fiom 
20 both IgGl and IgG2 isotypes, and a de-immunizing modification at die fusion 
junction. For example, the invention provides a d&-immunized, junction-modified 
immunocytokine using an IgG2 and an IgG2h hybrid Qg^2 modified in the hinge 
region to IgGl). In a preferred embodiment; die hybrid fiiaon protein consists of a 
de-immunized immunoglobulin moiety conq>osed of an IgjG (yl :CH1-H)(y 2: CH2- 
25 CBB) and a cytokine moiety. 

[0026] In another aspect, die invention provides novel nucldc add sequences diat 
encode fusion protdns widi reduced immunog^dty or fiuoHtate the expression, 
productioxi, and secretion effusion proteins with reduced iinmunog^city. Such 
nucldc acids are generated according to standard recombinant DNA techniques. 
30 [0027] In a preferred embodiment, a nucleic add molecule encodes an 
immunocytokine 
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fusion piotdxL A prefeaxediinmimocytoldne includes a 

Interleuldn 2, and ai tumor specific monoclonal antibody soch as an antibody to human 
epiOielial cell adhesion molecule KS A CBP-CAM)0iuK:S). 
[0028] In another prefered CTibodiment, nucleic add molecules encode Fc fusion 
5 proteins in various configurations. The nucleic add molecule encodes serially in a 5' 
to 3' direction, (0 a signal sequence, an immunoglobulin Fc region and a target 
protein sequence or (ii) a signal sequence, a targ^ protein, and an immunoglobulin Fc 
re90I^ or (iii) a signal sequence, a first target protem, an mununogiobulm Fc region, 
and a second target protein. The resulting nucldc acid molecule diereby eocodes an 
10 Fc-X,X-Fc, or X-Fc*Y structure where X and Y are a target proteiiL la an alternative 
CTbodiment, a imcldc add encodes an Fc-X, X-Fc, or X-Fc-Y protein without a 
signal sequence. 

[0029] In another preferred embodiment, a nucldc add of the invention encodes an Ig 
fusion protein with mutant or hybrid isotypes. Specifically, the nucldc add provides 
IS antibody moieties with hybrid isotypes, or alternatively with altered hinge regions. 
For sample, the fusion protein consists of an IgG2, modified to contain fewer 
disulfide bonds in die hinge region, or an ^G2 CH2 and CH3 region in which the 
hinge region derives from another antibody, preferably a normal or mutant IgGl 
hinge region. 

20 [0030] A nucldc add of the invention is preferably incorporated in operative 
assodadon into a rq>licable expression vector which is thm introduced into a 
mammalian host cell competent to produce the fusion protein. The resultant fusion 
protein is produced efBciently and secreted from die mammalian host cell. The 
secreted fusion protein is subsequently collected from die culture media without 

25 lysmg the rhammalian host celL The protein product is assayed for activity and/or 
purified using common reagmts as desired, and/or cleaved from the fusion partner, all 
using conventional techniques. 

[0031] Thus, the invention also provides me&ods for producing fusion proteins with 
reduced immunog^dty. 
30 [0032] Mediods and compositions of the invention are also useful to provide 
ther^eutic 

treatment using a fusion protdn that has been rendered less immunogenic. An overall 
object of the invention is to provide processes that are both efELcient and inexpensive 
as well as proteins that are less immunogemc. Preferred ther£4>eutic compositions of 
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ihe inveation include a dienq)eutically effective amount of de-inmiunized fiision 
protein. Preferably, the de*iinmunized fusion protein is administered along with a 
phaimaceutically acceptable cani^. 

[0033] The foregoing and other aspects, features and advantages of the present 
5 invention 

will be made more q)parent fiom die detailed desoiption, drawings, and claims that 
follow. 

Detaflcd Description of the Invention 
[0034] M proteins, including antibodies, that are administered to a patient for 

1 0 tfaen^^eutic use have die potential to induce an immune response in the recq)ient host 
This immune response is mediated by T-lynq[>hocytes (T-cells) ^^ch then trigger B- 
lymphocytes (B-cells) to make antibodies. Antibody production against the 
therapeutic agent is detrimental since it leads to more rzpid elimination of die 
thrarapaitic agrat and may induce an allergic response. 

IS [0035] The present invention provides methods of reducing the immunogenicity of 
fusion proteins. According to one method ofthis invention, potential T-cellq>itopes 
are idoitified in the junction region of a fusion junction in a fusion protein. T-cell 
q>itopes are identified by a variety of con^uter and non-computer methods, including 
prediction based on structure-based computer modeling or by synthesis of peptides 

20 and testing for binding to specific MHC Class II molecules or in an immunpgeuicity 
assay. 

[0036] According to die invention, a fusion junction is defined as the position 
between die last (Crterminal) amino add of a first protein or peptide and die first (N* 
terminal) amino acid of a second protein or peptide in a fusion protein. Accordingly, 
25 a fusion junction includes any amino acids between the last amino acid of one protein 
and die first amino add of a second protein, hi one embodunent, the fusion junction 
includes a linker. 

[0037] According to the invention, a junction region is the region of a fusion protein 
surrounding or spanning the fusion junction between two proteins. A junction region 
30 preferably includes between 1 and about 100 amino adds, more preferably between 1 
and about 50 amino acids, or between 1 and about 25 amino acids, and even more 
preferably betwera 1 and about IS amino acids, or between 1 and 9 amino adds. In 
one embodiment, a junction region comprises a spacer or linker peptide inserted at the 
junction point between the two proteins. According to the invention, a junction 
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region including a spacer or linker pq>tide can also be do-inmramzed to minimize the 
response of a patient to a fusion protein including tfie spacer or link^. 
[0038] According to tbe invention, a junctional T-cell q>itope is defined as a pq^tide 
sequence capable of binding an MHC Class n containing at least one amino add 
5 derived fiom each of at least tivo different fusion partner proteins. For exanq>le, Paul 
(Fundamental hmmmohgy. Chapter 8» Table 8, p. 276 [2000] 4^ ed*) illustrates 
segments of 10 amino adds that can bind to an MHC Class n molecule. Iha 
junctional T-cellq)itqpey these 10 amino add peptides are derived from different 
fusion partners. According to the invration a potential or candidate T-cellq>itope 
10 that ^ans a fusion junction (a candidate junctional T-cell epitope) preferably includes 

I to 8 amino adds from dtfaer ade of the junction, and more preferably 1 to 10 or 1 to 

I I amino acids from dUier side of the junction. Candidate epitopes are preferably 9, 
1 1, or 12 amino adds long. Accordingly, since a junctional T-cell epitope of fte 
invention includes at least one amino add from each side of the junction, preferred 

15 candidate T-cell epitopes are junctional epitopes ftat include 1*8 (or 1-10, or 11) 
amino adds from one side of the junction and also include a complementary number 
of amino adds from the other side of the junction to result in an q>itope having 9-12 
amino adds, and most preferably 9 amino adds. 

[0039] According to the invmtion, anchor residues wifliin a junctional T-cell epitope 
20 are 

thea mutated to prevent binding to an MHC Class n molecule. Li gra^al, care is 
taken to not introduce additional potmtial T-cell epitopes, and to preserve the 
function of each fusion partner. 

[0040] According to flie invention, a fusion of wild-type sequences is a fusion in 
25 which 

the sequmces at tiie N-terminal and C-terminal sides of the fusion junction are 
derived directly from naturally occurring sequences. 

[0041] According to fiie invention, a de-immunized fusion junction is a junction 
sequence 

30 in which one or more substitution mutations have been introduced relative to a 

junction of wild-type sequences. In a most preferred embodiment, deimmumzation of 
a fusion junction does not involve introduction of a linker, such as a 'non- 
immunogenic' Gly-S^ linker, and die spatial relationship between the frision partners 
is not altered in a de-immunized fusion protein. According to the invention, one or 
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mor e amino acids can be substituted or changed in the junction region either N- 
terminally to the fusion junction, C-tCTuinally to the fusion junction, or both N- 
terminally and C-tenninally to the fusion junction. 

[0042] According to the mvention, a potential T-cell epitope is a sequence that, when 
5 considered as an isolated pqitide, is predicted to bind to an MHC Class II molecule or 
an equivalent in a non-human species. A potential T-cell q)itope is defined without 
consideration of other aspects of antigen processing, such as the efficiency of protein 
uptake into antigen-presenting cells, the efficiency of cleavage at sites in an intact 
protein to yield a peptide that can bind to MHC Class n, and so on. Thus, the set of 

10 T-ceU epitopes diat are actually presented on MHC Class n after administration of a 
protein to an animal is a subset of the potential T*cell epitopes. 
[0043] According to flie invention, a T^cell epitope is an epitope on a protein 
fliat interacts with an MHC class n molecule. Without wishmg to be bound by 
ibsory, it is understood that a T-cell epitope is an amino acid sequence in a protein or 

IS a fusion protein, that failed to nndCTgo the negative T-cell selection process during T- 
cell development and therefore will be e3q>6Cted to be presented by an MHC Class n 
molecule and recognized by a T-cell receptor. In a preferred embodiment of the 
invention, the non-self T-ceU epitopes are present in die junction region at the fusion 
junction of two proteins ttiat form a fusion protein. 

20 [0044] The invention provides non-computer methods for reducing or eliminating the 
number of T-cell epitopes in a fusion protein junction without requiring elaborate 
conq>nter simulations or protein three-dimensional structures. In one embodiment, a 
method of die invention takes advantage of the fact that a core segment of nine amino 
acids interacts with both die MHC class II molecule as well as the T-cell receptor 

25 during antigra presentation. The N-terminal most arnino acid is called an "'anchor^' 
position residue diat binds to a deep pocket within the MHC class n molecule. One 
of the following amino adds is typically present at the anchor position which is 
inqtortant for binding to an MHC class n molecule: Leucine^ Valine, Isoleucine, 
Methionine, Phenylalanine, Tyrosine and Tryptophan. According to the invention, an 

30 additional 2 to 3 amino acids adjacent to die core 9 amino adds also affect the 

interaction widi MHC molecules, hi addition, die C*tenninal most amino add in the 
first protein ofthe fusion protein can generally be substituted. This is useful 
espedally wh» the N-tenninal fusion partner or first protein is known to be active 
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wh^ fused to die C-tenninal fusion partner or second protein at the C-tenninns of the 
first protein. 

[0045] A general method of flie invention mcludes mutating any Leucines, Valines, 
Isoleudnes, Methionines, Phenylalanines, Tyrosines or Tryptophans that occur in the 
S C-termmal most eight amino adds of an N-temiinal fusion partner in a fusion protein. 
In one embodiment, one or more of these amino adds in a candidate junctional T-cell 
epitope amino adds is preferentially mutated to a Threonine, an Alanine or a Proline. 
This retains some of fiie hydrophobic nature of the amino add that is r^laced. In 
fiirther embodiments of the invention, one more more of flie above-mentioned amino 

10 adds is deleted fiom a candidate or potential junctional T-cell epitope, or replaced 
with an appropriate amino acid analog. According to the invention, if an amino add 
is deleted to destroy a potential T-cell q>itope, care is taken not to generate anew T- 
cell epitope that includes amino adds near the deletion. 
[0046] According to the invention, it is oft^useM to construct a generalized 

IS expression 

plasmid construction intermediate comprising the coding sequence for an N-temiinal 
fusion partner containing a mutation of one or more hydrophobic residues in &e last 
dg^t amino adds. Generally, such a plasmid has one or more convenient restriction 
enzyme sites at or near the DNA encoding the C-tenninus of tiie N-tenninal fusion 
20 partner. . 

[0047] The purpose of a plasmid construction intermediate is to construct esqpression 
plasmids encoding a fusion protein in which one or more N-teiminal fusioii partaers 
has one or more substitutions of a Leudne, Valine, Isoleudne, Methionine, 
Phenylalanine, Tyrosine, or Tryptophan to another amino add in the eight C-terminal 
25 amino adds. The construction ofsuch final expres^on plasmids maybe 

acconq)lished by a variety of other methods well known in flie art, such as generation 
of PCR fiagments or synthetic nucldc adds, followed by ligation of the fir^gment into 
an sqypropriated vector or attachment with other sequences through well-known PCR 
techniques. 

30 [0048] Specific preferred embodiments include Fc-X fusion plasmids, albumin-X 
fusion 

plasmids, scFv-X fusion plasmids, and Fab-X fusion plasmids. in the Fc(gamma)-X 
case, it is useful to introduce mutations into the coding sequence to bring about amino 
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add substitutions of flie I^dne-Serine-Leudne-Serine segmeat near C-tennmns fhe 
Fc rcgiou of an IgGl, Ig(32, IgG3, or IgG4 molecule, as diagrammed heie for IgGl : 
Amino add sequences of human Fc regions derived fiom IgGl, IgQ2, Ig03 and IgG4 
. ared^ictedinSEQIDNOs: l,2,3and4re^ectively. 
5 [0049] Inoneexample, KSLSLSPGK:(SEQIDN0: S)ischang6dtoKSATAIPGK: 
(SEQ ID NO: 6). This nmtation is designed to efiminate potential junctional T-cell 
efpitopes and also remove a T-cell epitope in wfaidi the iq[)stream Phenylalanine or 
Tyrosine serves as a position 1 andior residue. 

[0050] Alternatively, it is sometimes usefiil to combine mutations tiiat remove 
10 candidate junctional T-cell epitopes witti amutation tiiat extends the serum half-life. 
For example, by changing KSLSLSPGK (SEQ ID NO: 5)toKSATATPGA(SEQID 
NO: 7). 

[0051] Other embodiments include substitutions in the LSLS segment to other 
amino acids such as Glydne or Proline. 
15 [0052] In the case of ^ression vectors used for making IgA fusion proteins, it is 
useful 

to delete some of the C-teiminal amino adds, so that the cysteine near the C-tmninus 
that is involved in oligomerization of IgA is deleted For exan^le, fifteen amino 
adds can be deleted, such that the IgA heavy chain sequence ends with Proline- 
20 Threonine-ICstidine before bemgfiised to a second protem. In addition, it is useful to 
introduce the foUowmg changes near the C-temiinus of CH3 domain of the IgA Fc 
region: 

QKTIDRLAGKPTH (SEQ ID NO: 8) changed to QKTADRTAGKPTH (SEQ ID NO: 9) 
25 [0053] Additional de-immunized sequences in an IgA-X fusion protein are» 
QKTPTRTAGKPTH (SEQ ID NO: 10) 
QKTPTOPAGKPTH (SEQ ID NO: 11) 
QKTATRPAGKPTH (SEQ ID NO: 12). 

30 [0054] In the case of an albumin-X fusion, it is useful to introduce the following 
changes in an albumin-X expression plasmid such that the C-t^minus of albmnin is 
modified as follows: 

KKLVAASQAALGL (SEQ ID NO: 13) changed to KKLVAASQAATTA (SEQ ID NO: 
14). 

12 
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[0055] Thus, fhe inveirtion provides xmcldc add secpiences and piotebs that are 
useful in constraction of less immunogeDic fusion protdns. Spedfically, flie 
invention provides proteins with mutations of any Leucines, Valines, Isolaidnes, 
5 M^onines, Phenylalanines, Tyrosines, or Tryptophans in tfie last dg^t amino acids. 
The protdns are prefoably human protdns with sequences that generally wmspmd 
to sequ«ices found in tibte human body. The invention also provides nucldc add 
sequences aicoding such proteins. The nucldc add sequences for this aspect of die 
invention may exist as plasmids, PCR-generated fragments, or nucldc adds produced 

10 by chemical synthesis. 

[0056] The invention also provides expression plasmids encoding a fusion protein in 
which one ormoreN-teminal fusion partners has one or more mutations of a 
Leucine, Valine, Isoleudne, Metihionine, Phenylalanine, Tyrosine, or Tryptophan to 
another amino acid in the d^t C-tenninal amino adds. 

15 [0057] For exan^le, plasmids encoding an Fc-IL2 or ^ole-antibody-IL2 fiision 
protein 

in which die Fc region is mutated as described above are provided by the invention. 
Ih addition, fudons conqprising an Fc region mutated as desoibed above to normal or 
mutated forms of eryfhropoietiii, such as die forms of ^ythropoietin described in 

20 WOOl/36489, are provided by the invention. 

[0058] The invention also provides a mediod for reducing immunogenidty of a fiidon 
protem junction by introducing an N-linked or O-linked glycosyladon site near, or 
preferably, at a fiidon junction. For example, the amino adds Asparagine, Serine or 
Threonine, and a third reddue are introduced as follows. Condder a sequence in 

25 which X's represent amino adds of an N-temiinal fudon partner, and Z's represent 
amino adds of a C-temiinal fiidon partner. 

X1X2X3X4X5NG S Z3Z4Z5Z6Z7Z8Z9 

[0059] According to this method, binding of a junction peptide is not necessarily 
30 blocked 

by introduction of the glycosylation site. However, any peptide that is bound in the 
MHC Class n groove and has the glycosylated asparagine C-terminal to the N- 
tenninal-most anchor reddue will not fimction as a T-cell epitope. The presence of 
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fhe iarge glycosyladon moiety vnR sterically hinder recognition of fbe MHC Class 
IVpq>tide co]iq)lex. A prefeiredglycosylation site includes &e sequence Asn-X-Ser 
or AsnrX-Thr. wherein X is preferably Gly, but can be any amino add. 
[0060] Furdiennore, the intrododion of mutations introducing Glycine and Serine 
5 residues does not create new T-cell^topes. Neittier Glycine nor Serine can act as an 
anchor residue. During antigen processing, a fiision protein, in principle^ is cleaved 
between the glycosylated Asparag^e and the Glycine or between tfie Glycine and the 
Soine. In either case, the resulting peptides have Qie mutant Glycine and/or Serine 
residues N-tmninal to an anchor residue, and thus the mutant Glycine and/br S srine 
1 0 residues are not recognized by a T cell receptor, since residues N-terminal to an 
andior residue are outside the region recognized by ike TCR. 
[0061] In a variation of this method, a fiision junction region aheady contains a 
Sctio or 

Threonine preceded by an amino add residues such as Glycine, Serine, Alanine, etc. 
IS The second method is preferably used when a junction region is flexible and 

displaced fit>m the hydrophobic core of each fusion partner, so that tiie novel N-linked 
glycosylation does not interfere with Uie folding or function of dther fusion partner. 
[0062] It is a straightforward matter Sat those skilled in the art of protein engmeering 
to 

20 determine when introduction of a glycosylation site is feasible. For exanq>le, the 
three-dimensional structure of each fusion partus, or close homologs of fiie fiision 
partners, may be known. Itisoftenfliecase^t afewaminoadds atfheN-terminus 
or C-tenninus of aprotdn are not resolved in an X-ray structure, or exhibit many 
possible conformations in an NMR structure, hi cases wh^ three or more amino 

25 adds are disordered on dther side of a glycosylation dte, th^e is some confidence 
tfiat the resulting fusion protein will fold correctly and both partners will be active. 
Some routine eicpmmentation is necessary to determine whettier a given fusion 
protein construct will be fimctionaL 

[0063] In preferred embodiments of the mvention, both the N-terminal and the 
30 tenninal partner ofthe fusion protein are human proteins. Potential T-cell epitopes in 
such fiision proteins are created fiom the final 8 amino adds ofthe N-terminal partner 
(first protdn) combined with the first 8 amino adds of the C-tenninal partner (second 
protein). This provides a series of 8 hybrid 9-mers created fiom the first and second 
proteins. Any aliphatic or aromatic residue (Leucine, Valine, Isoleucine, Methionine^ 
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Phenylalanme, Tryptophan or Tyrosine) in tfie last 8 amino adds of the fiist protein 

presents a hi^ risk of creating an MHC binding peptide with flie amino add in die 

first position (andior position) diat binds the pocket of the MHC molecule. 

Therefore, substitution of any of the above-mentioned amino acids, with an amino 
5 add tiiat is not one of die above-mentioned amino adds, and preferably with Alanine, 

Proline or Ihreomne, will rmove a candidate T-cell qpitope. 

[0064] For exanq>le, in the case of an Fcfudon protein containing the sequel 
HNHYTQKSLSLSPGKGGGGSGGGGSGGGGS (SEQ ID NO: 15), 

the leudne residues a:eate two potential epitopes. Therefim, the sequence can be 
10 de-immunized as; 

HNHYTQKSATATPGKGGGGSGGGGSGGGGS (SEQ ID NO: 16), 

by changing L to A and S to T. These changes remove epitopes with Leudne as tiie 

first amino add in die MHC bindmg pocket and Tyrosine as the first amino add in the 

MHC bindmg pocket, respectively. 
IS [0065] These substitutions for deimmunization work in humans for all Fc fiision 

proteins, bofii with and widiout Imker sequences, pref^ably ^en 1) both proteins in 

the fusion protein are human proteins; 2) die MHC binding peptides in die natural 

sequences of both proteins are ignored; and 3) the 9-mers identical to the original 

sequences are also ignored 
20 [0066] Mediods of die mvention are generally s^licable m all vertebrate organisms, 

preferably in mammals and most prefmbly in humans. The invention is illustrated 

further by die following non-lhniting exan^Ies. 

Examples 

25 Example 1: Deduction of immunogenic reactive epitopes of huKS-IL2 immnnocvtoMne, 

[0067] HuKS-II2 consists of humanized Vh and Vl regions combined with human H 
and 

L chain constant regions. The H chain was fused at its carboxyi terminus to the 
30 mature sequence ofhumanILr2 as desoibed previously. This H chain is of the 7I 
isotype and has hi^ afSnity for Fc recq)tors. Because of this high afOnity HuKS-IL2 
was cleared quickly from die circulation. Widiout wishing to be bound by dieory, the 
clearance of HuKS-IL2 presumably occurs via FcR-bearing cells in die liver (Kupffer 
cells) and ^leen (antigen presenting cells). 
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[0068] It was pievioTisly established that certain patients had made immune responses 
to 

some portion of the hi]KS-E2 molecule, however, the epitopes recognized by these 
andbodi^ are not known. To deduce fbs reactive qntopes, relative reactivities of 
S padoit sera wi& huKS-IL2 conq)ared to otfier related proteins: 

(1) Hul4.18-IL2, a molecule witti completely different hunoiamzed V re^^ 
but exactly the same C regions and fiision junction with IL-2; 

(2) VHl , a de-immunized form of fauKS-0L2 with no T-cdl epitopes in the VH 
and VL regions, derived from mouse V regions with sur£ace-e3q>osed mouse B-cell 

10 q>itqpesvenemd to human residues. 

(3) VB2, a de-immunized form of huKS-IL2 with one remaining T-cell 
epitope in CDR3, derived from mouse V regions wiOi sur&ce-exposed mouse B-cell 
epitopes veneered to human residues, in which the VH contains one T-cell q)itope. 

(4) 42S-IL2 constructed with either KOL or EU Cfl regions (rather ttian KS) 
IS (to conpare allotypic reactivity); 

(5) huKS-mIL2 * a molecule wifli the huKS V regions fiised to mouse C 
regions and mouse ILr2; 

(6) humanFc-IL2; 

(7) human Fc only; 
20 (8) human IL-2 only. 

[0069] Immunoglobulin fusion proteins and fragments were purified by protein A 
Sq>harose chromatography and were coated on 96*well plates in bicarbonate buffer 
and then blocked with 1% goat serum containing 1% BSA. Dilutions of patient sera 
were incubated and then unbound material was removed by fliree washes with PBS- 
25 Tween. Bound human antibodies from the patient sera were detected with various 
HRP-conjugated antibodies depending on the bound protein. Generally, goat anti- 
human X chain HRP conjugate was used because most of the plate-bound proteins 
consisted of human Fc and human k chains. 

[0070] Cotain patient sera showed a clear reactivity to hi]KS-IL2 tiiat yras not 
30 detectable 

in pre-injection sera from the same patients.* Preimmune antisera was used to 
establish a baseline non-inununized control. Reactivity seen in patient sera can be 
attributed to (1) anti-IL2 reactivity, (2) anti Fc (allotypic) reactivity, (3) reactivity to 
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the novel junction sequence or (4) anti-idiotypic reactivity witii tibe KS idiotype, or a 
combination of reactivities. 

[0071] No patient sown reacted significantly with recombinant IL-2 or to 
the Fc region (1 and 2 above). Some patients showed anti-idiotypic reactivity to the 
5 KSV regions. All patient sera showed reactivity with Fc-IL2. Three of four patients 
showed reactivity to Fc-IL2. The presence of reactivity against Fc-IL2 but not against 
either Fc or JL2 suggests that the junction between Fc and IL2 was recognized by the 
patimts' anti-sera. 

10 Example 2; Modific a^'t^n nf amin o acid residues at the junction of an antibody- 
CVtoldne fusion protein to Tednce itniminopreniH tY ft1i"^inat^Q^ o f MHC Class n 

[0072] Peptide threading analysis identified two overlapping peptide segments wifli 
strong MHC binding potential at the junction between the Fc and IL2 portion of the 

IS immunocytoldne. The peptide threading and identification of potential T-cell 

epitopes was performed as disclosed in Cair (WOOO/34317). Amino acid changes 
were introduced such that the existing potential MHC Class n binding epitopes were 
eliminated, but new potential MHC Class n q)itopes were not introduced 
[00731 Modification of a junction sequence LSLSPGK-AP (SEQ JD NO: 17) to 

20 ATATPGA-AP (SEQ ID NO: 18)C'LSLS to ATATO, where the hyphen is the 
immunocytoldne huKS-IL2 junction, made junction-derived peptide sequences 
incapable of binding to any human MHC Class II with an afGnity high enough to 
result in inununogenicity. 

25 Example 3: Modificarinn of amino acid residues at the junction of immunocvtokine 
fiision proteins to reduce immmogenicitv 

[00741 Modification of a junction sequence LSLSPGK-AP(SEQ ID NO: 17) to 
LNLSPGA-AP (SEQ ID NO: 19)C*LSLS to LNLS")> where the hyphen is the 
immunocytoldne huKS-IL2 junction, results injunction-derived peptide sequences 
30 that are still capable of binding to certain MHC Class n molecules. However, when 
the KS-IL2 protein is expressed in mammalian cells and secreted, the protein is N- 
gIycos}1ated near the junction because of the NXS/T sequence. 
[0075] The resulting junction-derived peptides are not effective as T-cell epitopes. 
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because when ihe junction-derived peptides are presented to T cells by MHC Qass 
the large N-glycos^ation moiety prevents specific docking between a T cell-receptor 
and MHC Class IL 

5 Example 4: Characterization of tmnmn e reactivitv of antigen presenting cells to 
immiinrv^Viuft frnTTQ^TT 1 in comparison to a de-nmnnniy.ed huKS-IL2 

iTnTnimnrytnlrine. 

[0076] Reduction of immunogemcity due to modification of the reactive epitope by 
mutating LSLS to ATAT is directly tested as follows. Synthetic pq>tides mimicking 
10 this sequence alter the immune response of a classic antigen presenting cell such as a 
dendritic cell (DQ. The following synthdic p^tides 
KSLSLSPGK.APTS (SEQ ID NO: 20)and 
KSATATPGK-APTS (SEQ ID NO: 21), 

where the hyphen is ttie KS-IL2 junction, are used to stimulate DC-mediated antigen 
IS presentation to autologous T cells. The ability of those T cells to proliferate in 
response to a subsequent challenge with the pq>tide antigen serves as a measure of 
immunogemcity of tiiat peptide. 

[0077] Specifically, peripheral blood mononuclear cells ^PBMQ are isolated fiom 
leukopacks by standard density gradient techniques. Mononuclear cells are 

20 resuspended in serum-firee Aim V culture media and allowed to adhere. After 2 hat 
37 ^'C nonadherent cells are removed. Adherent cells are cultured for 7 days in media 
containing human GM-CSF (50 ng/ml) and IL-4 (20ng/ml) to derive immature 
dendritic cells (DC). After 7 days, flie cells are harvested and phenotypically 
characterized by flow cytometry with appropriate FTIC-labeled Abs for MHC class I, 

2S MHC class n, CD80 and CD40 to confirm the immature DC phenotype. 

10078] Non-adherent cells are cultured witii IL2 and IL7 to obtain autologous efifector 
cells CT-cells) to be used in subsequent fimctional studies. For fimctional studies, T-cells 
are added to immature dendritic cells (10:1 ratio) and co-cultured with huKS, de- 
immunized huKS, peptide junction 13mer(KSLSLSPGK-APTS) (SEQ ID NO: 20) 

30 and 

the modified, de-immunized 13 mer peptide (KSATATPGK-APTS) (SEQ ID NO: 
21). Comparison of the proliferation index, as measured by tritiated thymidine 
incorporation after exposure to each of the immunocytokines or immunogenic and 
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mGOined de-inmiunized pq>tides demonstrates Che degree of immunogenicity of each 
molecule. Namely, an inoease in radioactive incorporation is roug^y prqxniional to flie 
ability of each pq)tide to be bind to a class n MHC molecule on DC and be presented to 
T cells. 

5 

Example 5; Deduction of fnummfigenic reactive epitopes found in albnmm fiisian 
proteins and modificatjon nf ammo acid residues at a fiasion junction to reduce 

[0079] HuniansennnaIbuinin(HSA),duetoitsremadcablylonghalf-^^ 
10 vivo distribution and its lack of enzymatic or immunological functions, has been used 
as a carrier for therapeutic peptides/proteins. A genetically engineered HSA-CD4 
hybrid has hem shown to block die entry of the hmnan immunodefidency virus into 
CD4+ cells while exhibiting antiviral in vitro propolies similar to diose of soluble 
CD4 (Yeh et al, PNAS 89:1904-1908, 1992). Thus, the genetic fiiaon of bioactive 
IS peptides to HSA is usefid for designing and recovering secreted therapeutic HSA 
. derivatives. However, as with aU fusion proteins, HSA-CD4 has a novel junction 
which can be immunogenic and contains T-cell epitopes capable of being presented 
on MHC class II molecules. Analysis of die junction between HSA and CD4 using 
the methods of Examples 1, 2, 3, and 4 identifies peptides with MHC binding 
20 potential The potentiaUy immunogenic sequences are modified to decrease or 
eliminate potential T and B-cell epitopes in order to reduce immunogenicity. 
Similarly, a novel glycosylation site can be introduced into the junction region in 
order to reduce immunogenicity. 

Albumin sequence CD4 sequence 

25 TCFAEEGKKLVAASQAAIXjL-KKVVIXjKKGDTVELTC^ (SEQ ID NO: 

22). 

[0080] It is contemplated by the invention that the HSA-IFNalpha fusion protein junction 
region contains three candidate T-cell epitopes, 
KKLVAASQAALGL (SEQ ID NO: 13); 
30 KLVAASQAALGLC (SEQ ID NO: 23); and 
LGLCDLPQTHSLG (SEQ ID NO: 24). 

[0081] The T-cell epitopes dq)icted in SEQ ID NOs: 13 and 23 overlap and can be 
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de-inunumzedby changing LV (inbol^ to anything except F, I, L, M, V, W and Y. 
Alternatively, the peptide threading scoie can be reduced significantly by dianging 
LGtoTT. TheT-ceUq)itcq>einSEQIDNO:24canbede-inimunizedbychangi^ 
the second L (in bold ) to an A. 
5 [0082] Fiiidiamore,itiscontempMedtibatinthecaseofanHSA-Xfu^ 
X 

can be any protem, ddmmunizadon of the fusion junction is achieved by changing tiie 
ammo acid sequence AALGL(SBQ ID NO: 25) to TATTA (SEQ ID NO: 26). 
CFAEEGKKLVAASQTA1T4 (SEQ ID NO: 27). 
10 Example 6: X-Fc fusion proteins an H mnHificfl tian of amino acid residues at a fusion 

[0083] In some instances it is specifically advantageous to engineer a fusion protem 
in 

the X-Fc orientation. With these constructs, a target protein is a N-tenninal fusion 
15 protein and a Fc firagment follows. For example, die glucagon-like peptide (GLP-1) 

requires a firee N-terminus for its activity, so a 6LP-1-Fc fusion is usefid. 

[0084] A GLP*1-Fc fusion protein is constructed according to standard techniques 

described in the art TMsfusionpioteinhastheC-temiinusofGLP*! joined to the 

hinge of the yl heavy chain. The y 1 hinge sequence containing a Cys to Ser mutation 
20 (residue 5) which eliminates the Cys residue that forms a disulphide bond with the 

li^t chain in IgGl (Lo et al., (1998) Protein Engmeecing 1 1:495-500) is used The 

non-mutant Fc sequence is 

EPKSCDKTHTCPPCPA PELLG (SEQ ID NO: 28) 

with the hinge region being underlined, followed by the start of the CH2 domain 
25 sequence. 

[0085] The fusion junction between GLP-1 (7-37) and mutant Fc is: 
HAEGTFTSDVSSYLEGQAAKEFIAWLVKGRG - 
EPKSSDKTHTCPPCPAPELLG (SEQ ID NO: 29). 
[0086] The fusion junction between GLP-1 (7-37) and normal Fc is: 
30 SYLEGQAAKEFIAWLVKGRG - EPKSCDKTHTCPPCPAPELLG (SEQ ID NO: 
30) 

[0087] Three potential epitopes are identified by peptide threading at die GLP-l-Fc 
fusion junction. 
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KEFIAWLVKGRC® (SEQ ID NO: 31) 
EFIAWLVKOIGEP (SEQ ID NO: 32) 
AWLVKGRCTPKSS (SEQ ID NO: 33). 

S [0088] Analysis of fusion junctions between GLP*1 (bold tsxi) and Fc (plain text), 
performed as in Examples 1-3, identifies pq>tides ivith MHC binding potential. After 
identification of potential sites by pqptide tbieading analysis, tiie potentially 
immunogenic sequences are modified by ainino add substitution to 
eliminate potential T and B-ceH binding epitopes and decrease immunogenidty. 
10 [0089] The above-menticmed potential T-cell epitopes depicted in SEQ ID NOs: 31, 
32 

and 33 ate de-immunized by making single amino add substitutions. For exan^Ie, 
peptide shown in SEQ ID NO: 31 is de-immunized by changmg the Lysine (shown in 
bold) to a Tbieonine and the Arginine(shown in bold) to a Threonine. The peptide 
15 shown in SEQ ID NO: 32 is de-immunized by replacing the l^leucine (shown in 
hold) witii an Alanine or a Proline and tiie peptide in SEQ ID NO: 33 is de- 
immimized by replacing the Leucine with an Alanine or a Proline. The resulting de- 
immunized junction is: 

20 HAEGTFTSDVSSYLEGQAAKEFAAWAVTGTG - EPKSSDKTHTCPPCPAPELLG 
(SEQ ID NO: 34). 

[0090] According to an exemplary method for introducing a glycos}dation site at a 
fusion 

25 junction the following changes are introduced: 

SYLEGQAAKEFIAWLVKGRN - GSKSSDKTHTCPPCPAPELLG (SEQ ID NO: 
35). 

Example 7; De t<nrrHfm of imm imogenic reactive epitopes of EnbreL a TNFR-Pc 
30 fusion protein and g indi'ficati on of amino acid residues at a fiision junction to reduce 
immunogenidtv. 

[0091] ENBREL or etanercept, a X-Fc fusion protein approved by the FDA, is a 
tumor 
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necrosis fiu:tor(TMOii^itor used to treat di^^ ENBRELisa 
dimeric fmon protein consisting of an extracellular-figand-binding domain of TNF 
receptor linked to an Fc protein of human IgGl. TNFR-Fc conq>etitively inhibits 
binding of TNF to its receptor and rmdecs the bound TNF biologically inactive^ 
S resulting in sigoificant reduction in inflanmiatory activity. As described above fox 
GLP-1 - Fc, TNFR-Fc has a novel junction which contains potential T-cell epitopes. 
[0092] The junction between a direct fusion of a C-teaninus portion of TNF-R (bold 
text) to the N-tenninus of the gl hinge (plain text with the underline region 
rqnesenting the hinge r^on) is 
10 STSFLLPMGPSPPAEGSTGD - EPKSCDKTHTCPPCPA PELLG (SEQ ID NO: 
36) 

100931 Analysis of ajunction between TNF-R and Fc,perfonned as in Bcamples 1-4, 
identifies peptides with MHCbindingpotmtial. After identification of potential sites 
IS by peptide threading analysis, flie poteodally immunogenic sequences are modified by 
amino add substitution to reduce or eliminate potential T and B-cell binding q>itopes 
and decrease immunogenicity. 

[0094] According to an exemplary method for introducing a glycosyiation site at a 
fusion 

20 junction the following changes are introduced: 

STSFLLPMGPSPPAEGSTGN - GSESOTKTHTCPPCPAPELLG (SEQ ID NO: 
37). 

Examples: Deduction ofimmunogenic reactive epitopes for Fc-X*Y fusion proteins 
25 such as Fc-IL12-IL2 and modific ation nf amm n acid residues at the fusion junction to 
reduce immunogenicitv. 

[0095] Fusion proteins of a Fc-X-Y orientation such as Fc-IL12-IL2 have multiple 
novel 

fiisionjunctions which are potentially immunogenic. For instance^ Fc-IL12 has a 
30 fusion junction similar to odier Fc-X fiision proteins or immunocytokines Example 
1) but is novel due to the usage of the cytokine 1L12. The fusion junction is analyzed 
for immunogenic binding sites and modified accordingly. Secondly, fiiere is an X*Y 
fusion junction comparable to fliat described in Example 5» with two different 
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c}1oidnes constitating a fusion protein. Peptide fluead analysis is used for each of &e 

fusion junctions. 

[0096] Analysis of the junctions: 

(1) MHEALHNHYTQKSLSI^PGK-KNIJVATPDP^^ 
5 38) 

between the C-temunus of Fc (bold text) and the N-teEminus of IL12p35 (plain text), 
and 

10 (2) RAQDRYYSSSWSEWASVPCS - APTSSSTKKTQLQLEHLLLD (SEQ ID NO: 39) 

between the C-teminus of IL12p40 (bold text) and flie N-tenninns of IL2 (plain text) 
by peptide ttireading identifies pq>tides with MHC binding potential. The potentially 
immunogenic sequences are modified to decrease or eliniinate potential T-cell 
15 epitopes. 

[0097] For exan^le, in sequence (1) above, the following changes are made: 

MHEALHNHYTQKSA1\41T (SEQ ID NO: 40). 

20 [0098] These changes reduce or eliminate MHC Class U-binding potential of sev^ 
T 

cell epitopes at a junction of Fc and ttie p35 subunit of IL12. 
[00991 In anoth^ exanq>le, sequence (2) above is modified to introduce a 
glycosylation 

25 site by introducing an Asparagine and Glycine at flie first two positions within IL-2. 
This strategy uses the naturally occurring Threonine at portion 3 of mature IL-2. In 
addition, it is inq)ortant to not disrupt Hic formation of a disulfide bond in the p40 
moiety, so it is useful to separate the glycosylation site by at least one or two amino 
adds from the Cysteine in p40. 

30 

RAQDRYYSSSWSEWASVPCS . NeTSSSTtOCTQLQLEHIXLD (SEQ ID NO: 
41). 

[0100] Di the case of the IL12p40-IL2 fusion, introduction of a glycosylation site as 
discussed above creates the following potoitial T-cell epitopes. 
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SEWASVPCSNGTS (SEQ ID NO: 42) 
ASVPCSNGTSSST (SEQ ID NO: 43) 

fOlOll However, glycosylation of flie T-cell epitope prevents MHC Qass II binding thus 
resulting in reduced immunogenicity. 

Example 9: Deduct inn of iTTunimofyenic r eactive epitopes in junction of an X-Fc-Y 
fusion protein and modificatiop of awimn acid residues at a fusion junction to reduce 
MHC gass n binding. 

[0102] Fusion proteins of the X-Fc-Y configuration, such as IL4-Fc-GMCSF, have 
multiple novel fusion jimctions that contain potential T-cell epitopes. The IL4-Fc is a 
jimction analogous to other X-Fc fusion proteins (Examples 6 and 7) but is novel due to 
the use of the cytokine TLA, For example, a form of Fc using a hinge region, CH2, and 
CH3 domain &om human yl is used. As stated above, a yl hinge sequence in pdCs- 
huFcyl may contain a Cys to Ser mutation (underlined) that eliminates the Cys residue 
fliat fomis a disulphide bond with a light chain in IgGl (Lo et al., (1998) Protein 
Engineering 1 1 :495-500), thereby creating a third potentially immunogenic fusion 
junction for analysis. The fusion junction is analyzed for potential T-cell q>itopes and 
modified according to the methods of Examples 1-4. 

[0103] There is an Fc-Y fusion jimction comparable to that described in Example 1 for 
the immimocytokine huK[S-IL2, with a different cytokine GMCSF constituting a fusion 
protein. This fusion junction is also analyzed for potential T*cell epitopes and modified 
according to the methods of Examples 1-4. 
Specifically, analysis of the junctions 

(1) ENFLERLKTIMREKySKCSS - epkscdkthtcppcpapellg (SEQ ID NO: 44) 

between the C-tenninus of IL4 (bold text) and tfaeN-temiinus of Fc (plain text), and 

(2) MHEALHNHYTQKSLSLSPGK-parspspstqpwehvnaiqe (SEQ ID NO: 45) 
between the C-terminus of Fc (bold text) and the N-terminus of GMCSF (plain text) by 

peptide threading identifies pqptides with MHC binding potential. The pot^tial T-cell 
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q)itopes are modified to decrease or eliniinate potential T epitopes in order to reduce 
inunimogenicity. 

[0104] A candidate T-cell epitope at the junction of IL4-Fc fusion protein is, 
EKYSKCSSEPKSC (SEQ ID NO: 46), 

where changmg E (in bold) to T reduces the peptide threading score or the MHC Class n 
binding potential significantly. The sequence of the modified IL4-Fc fiision is as follows: 

ENFI^IULKTIMREKYSKCSS - tpkscdkflitqjpcpapeng (SEQ ID NO: 47). 

(01051 The Fc-GMCSF fiision junction is de-immunized by changing the sequence LSLS 
to ATAT as shown below. 

MHEALHNHYTOK SATATP GK- parspspstqpwehvnaiqe (SEQ ID NO: 48). 

Example 10: Modification of amino acid residues at a fiision iimction of 
I'mTnimncYfnlrines anH imTnimofiisin s prepared with a hvfaiid is otvpe to ranove T-cell 
epitopes. 

[0106] It is often usefiil to construct an antibody or antibody-based fiision protein with a 
hybrid isotype» so that usefiil features of different isotypes may be combined into a single 
molecule. Fusion proteins with hybrid isotypes may be modified according to the 
invention to reduce immunogenicity. 

[0107] An antibody fiision protein with the following components is constracted by 
standard recombinant DNA techniques: a light chain and a heavy chain, the V regions 
recognizing a tumor-specific antigen, the light chain being a typical light chain, and the 
heavy chain comprising CHI, CH2, and CH3 domains from IgG2 and a hinge region 
&om IgGl , with a cytokine fiised to the C-tenninus of the heavy chain involving a fission 
junction as described above. 

[0108] This protein contains novel junctions between CHlg2 and hinge-gl, and hinge-gl 
and CH2g2. The identification and modification of potential T-cell epitopes in these 
junctions is performed as follows. For immunocytokines and Fc-X fiision proteins 
prepared with either an IgG2 or an IgG2h isotype, these modifications are identical to 
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those set forth in Examples 1, 2» 3, and 8 above. For X-Fc IgG2h immunoflisins, fhe 
novel junction is also identical since the N-tenninus of the Fc is located within the hinge 
region of flie IgG2h protein which has been modified to an IgGl ^e. However, there 
are two novel fusion junctions in that the IgjGl hinge inserted into a IgG2 
inunmioglobuhn creates two novel junctions between the IgG2 CHI and IgGl hinge and 
the IgGl hmge and flie IgG2 CH2. 

IgG2 CHI - IgGl hingp -IgG2 CH2-IgG2 CH3 - target protem . 
[0109] Thus, analysis of the junctions 

qtytcnvdhkpsntkvdktv - qpkscdkthtcppcp (SEQ ID NO: 49) 

between the C-terminus of IgG2 CHI (bold text) and tiie N-tenninus of the IgGl hinge 
(plain text), and 

epkscdkthtcppcp - appvagpsvfli^kpkdtl (SEQ ID NO: SO) 

between the C-tenninus of the IgGl hinge (bold text) and the N*temiinus of IgG2 CH2 F 
(plain text) by peptide threading should identify pqitides with MHC binding potential. 
The potentially immunog^c sequences are modified to deonease or eliminate potential T 
and B-cell epitopes in order to reduce immunogenicity. 

[0110] Two potential T-cell epitopes in the IgG2CHl-IgGl hinge fusion junction are, 

TKVDKTVEPKSCD (SEQ ID NO: 51) and KTVEPKSGDKTHT (SEQ ID NO: 52). 

; [0111] The IgG2CHl-IgGl hinge fiision junction is de-immunized by changing the V (in 
bold) to an A, a T or a P. The sequence of the modified fiision junction is depicted in 
SEQ ID NO: 53. 

qtytcnvdhkpsntkadkta - q)ks^]kthtq)pcp (SEQ ID NO: 53). 
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[01121 As stated above, the yl hinge sequence in pdCs-huFcyl may contain a Cys to Ser 
mutation (underlined) that eliminates the Cys residue that forms a disulphide bond with 
the Sgjit chain in IgGl (Lo et al., (1998) Protein Engineering 1 1 :495-S00), thereby 
creating two additional potentially immunogenic fusion junctions for analysis and 
modification: 

(3) qtytcnvdhkpsntfcvdktv- epksgdkthtcppcp (SEQ ID NO: 54) 

(4) epksSdktlitcppcp - qipvagpsvfli^kpkdtl (SEQ ID NO: 55). 

Example 11 r Generation of Fc-EPO fusion protein using hybrid isotvpe Fc components 
oflRGlandlRG4. 

[0113] To generate an Fc-erythropoietin fusion protein, the following expression plasmid 
was constructed using standard molecular biology techniques. An Xmal-Xhol DNA 
fragment containing a form of die human erythropoietin coding sequence with mutations 
resulting in the amino acid substitutions His32Gly, CysSSPro, TrpSSCys, aiui Pro90Ala, 
as disclosed in WO01^6489, was used The corresponding proteiii sequence is shown in 
SEQ ID NO: 56. 

APPRUCOSRVIERYLIJEAKEAEhnTroCAEGPSI^^ 

WQGLAII^EAVLRGQALLVNSSQPGEGUJUIVDKAVSGLRSLT^ 

AAFULinASXrFBKLFKVYS 

[0114] Tliis Xoial-Xhol DNA fragment was inserted into a plasmid vector that encodes a 
hinge region from IgGl and a CH2 and CH3 region from IgG2» except fliat there wm 
two sets of mutations fliat resulted in amino acid substitutions in the region of the CH3 C- 
tenninus, such that flie sequence at the junction of the CH3 C-tenninns and the Bpo N* 
terminus is as follows: 

.... TQKSATATPGA-APPRLI . . . .(SEQ ID NO: 57) 

10115] The first set of mutations, which change the sequence KSLSLSPG (SEQ ID NO: 
58) 
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of the IgG2 CH3 region to KSATATPG (SEQ ID NO: 59), is disclosed in U.S. Patent 
implication Serial No. 60/280,625. The eJBfect of the substitution of Leu-Ser-Leu-Ser 
(position 3 to position 6 of SEQ ID NO: 58) with Ala-Thr-Ala-Thr (position 3 to position 
6 of SEQ ID NO: 59) is to remove potential human non-self T-cell epitopes that may arise 
because the junction between human Fc and hnman erythropoietin contains non-self 
p^tide sequences. The second set consisting of the single amino acid substitution K to A 
at the C-tenninal amino acid of the CH3 region, is disclosed in U.S. Patent Application 
Serial No. 09/780,668. 

[0116] Hie resulting plasmid was transfected into NS/0 cells and the Fc-Epo fusion 
protein was expressed and purified according to the procedures known in the ait After 
purification based on binding to protein A, the huFc]f2h-huEpo protein containing the 
IgG2 CH3 and erythropoietin substitutions described above was characterized by size 
exclusion cfaromatogr^hy and found to consist of 97% moz^omer and 90% monomer in 
two independent preparations. The huFcyZh-huEpo protein containing the IgG2 CH3 and 
erythropoietin substitutions described above was found to be about as active, on a molar 
basis, as human erythropoietin in a cell-based assay that measured the ability of an 
erythropoietin protein to stunulate TF-1 cell division. The assay was performed as 
described in WOOl/36489. 

[0117] In addition, fusions of non-mutant human erythropoiedn to the C-tenmnus of an 
Fc region consisting of either IgGl(hinge-CH2-CH3), IgG2(hinge-CH2-CH3), or 
IgGI(binge)-IgG2(CH2-CH3) w^e characterized. Expression plasmids comprising non- 
mutant human Fc sequences and non-mutant erythropoietin sequraces were constructed 
analogously to the plasmid described above. NS/0 cells were transfected with tiie Fcyl- 
Epo, Fc72-Epo, and Fc72h-Epo expression plasmids, and stable clones were isolated after 
screening an ^^proximately equal number of clones for each plasmid. The best- 
producing clones yielded 50 jig/ml for Fcyl-Epo, 20 jigtol for Fqf2-Epo, and 120 figtol 
forFcy2h-Epo. 

[0118] The following example describes m detail a preferred mediod for identification of 
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immunogenic sequence regions (T-cell q)itopes) within the sequences of the fusion 
proteins as disclosed in this invention* However^ it should be pointed out, that said 
molecules can be obtained by othor known meOiods. 

Example 12. Identification of T-cell epitopes bv computational methods 
[0119] According to the invention, epitopes in a junction region of a fusion protein can be 
modified using methods for introducing mutations into proteins to modulate their 
interaction with the immune system. According to the invention, known methods in the 
art that can be ad^^ted according to the invention include those described in the prior art 
(WO 92/10755 and WO 96/40792 (Novo Nordisk), EP 0519 596 (Merck & Co.), EP 0699 
755(Centro de Immunologia Modcular), WO 98/52976 and WO 98/59244 (Biovation 
Ltd.) or related methods. 

[0120] Advantageous mutant proteins, however, can be obtained if the identification of 
said epitopes is realized by the following new method which is described herewith in 
detail and applied to the jimction region of fiision proteins according to the invention. 
[0121] There are a number of factors that play important roles in determining the total 
structure of a protein, polypeptide or immunoglobulin. First, the peptide bond, i.e., that 
bond which joins the amino acids in the chain together, is a covalent bond. This bond is 
planar in structure, essentially a substituted amide. An "amide"' is any of a group of 
organic compounds containing the grouping -CONH-. 

[0122] The planar peptide bond linking Ca of adjacmt amino acids may be represented 
as dq>icted below: 



[0123] Because the 0=C and the C-N atoms lie in a relatively rigid plane, free rotation 
does not occur about fliese axes. H^ce, a plane schematically dcfpicted by the intem^ted 
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line is sometimes refoied to as an "amide"' or 'peptide plane" plane wherein lie the 
oxygen (O), cazbon {C), nitrogen (N), and hydrogen (H) atoms of the peptide backbone. 
At opposite comers of &is amide plane are located the Ca atoms. Since there is 
substantially no rotation about the 0=C and C-N atoms in the peptide or amide plane, a 
polypqptide chain thus comprises a series of planar peptide linkages joining fbe Ca 
atoms. 

[01 24] A second &ctor that plays an important role m defining the total stroctore or 
conformation of a polypq>tide or protein is the angle of rotation of each amide plane 
about the common Ca linkage. The terms "'smgle of rotation** and 'torsion angle** are 
heremafter regarded as equivalent terms. Assuming that the 0,C,N, and H atoms remam 
in the amide plane (v^ch is usually a vaUd assunq)tion, although there may be some 
slight deviations fiom planarity of these atoms for some confonnationsX &ese angles of 
rotation define the N and R polypqptide's backbone conformation, Lc, the stmcture as it 
exists between adjacent residues. These two angles are known as ^ and \|/. Asetofttie 
angles <^i, y^u where the subscript i rq)reseats a particular residue of a polypeptide chain, 
thus effectiyely defines the polypeptide secondary stracture. The conventions used in 
defining the \|r angles, i.e., the reference points at which the amide planes form a zero 
degree angle, and the definition of which angle is ^, and which angle is for a given 
polypq)tide, are defined in the literature. See, e.g„ Ramachandran et al. Adv. Prot Chem. 
23:283-437 (1968), at pages 285-94, which pages are incorporated herein by reference. 
[0125] The presmt method can be q>plied to any protem, and is based in part upon the 
discovery that in humans the primary Pock^ 1 anchor position of MHC Class II molecule 
binding grooves has a well designed specificity for particular amino acid side chains. The 
specificity of this pocket is detemuned by the identity of the amino acid at position 86 of 
the beta cham of the MHC Class K molecule. This site is located at the bottom of Pocket 
1 and determines the size of the side cham that can be accommodated by this pocket. 
Marshall, K.W., /. Immunol., 1^2:4946-4956 (1994). If this residue is a glycine, then all 
hydrophobic aliphatic and aromatic amino acids O^ydrophobic aliphatics being: valine, 
leucine, isoleucine, methionine and aromatics being: phenylalanine, tyrosine and 
tryptophan) can be accommodated in the pocket, a preference being for the aromatic side 
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chains. If this pocket residue is a valine, dien &e side cham of this amino add pxotrudes 
into the podcet and lestiicts the size of pq>tide side chains that can be accommodated 
such fiiat only hydrophobic afiphatic side chains can be accommodated. Therefore^ in an 
amino add residue sequence, wherever an amino add with a hydrophobic aliphatic or 
aromatic side diain is found, there is the potential for a MHC Class n restricted T-cell 
epitope to be present If the side-chain is hydrophobic aliphatic, however, it is 
approximately twice as likely to be associated wi& a T-ceU epitope thm an aromatic side 
chain (assuming an approximately even distribution of Pocket 1 types throughout die 
global population). 

[0126] A cnmpntfttional method embodying the present invention profiles file likelihood 
of peptide regions to contain T-cell epitopes as follows: (1) The primary sequence of a 
peptide segment of predetermined length is scanned, and all hydrophobic aliphatic and 
aromatic side chains present are identified. (2) The hydrophobic afiphatic side chains are 
assigned a value greater fiian that for the aromatic side chains; preferably about twice the 
vahie assigned to die aromatic side chains, e.g., a value of 2 for a hydrophobic aliphatic 
side chain and a vahie of 1 for an aromatic side chain. (3) The values detennined to be 
present are summed for each overlapping amino add residue segment (window) of 
predetemiiined uniform length within the peptide, and the total vahie fox a particular 
segment (window) is assigned to a single amiao acid residue at an intermediate position 
of die segment (window), pref^ably to a residue at about the midpoint of die sampled 
segment (window). This procedure is repeated for eadi sampled overlappii^ amino acid 
residue segment (window). Thus, each amino acid residue of the peptide is assigned a 
value that relates to the fikeUhood of a T-cell epitope being present in tiiat particular 
segment (window). (4) The values calculated and assigned as described in Step 3, above, 
can be plotted against the amino add coordinates of the entire amino add residue 
sequence being assessed. (5) All portions of the sequence which have a score of a 
predetermined value, e.g., a value of 1, are deemed likely to contain a T-cell epitope and 
can be modified, if desired. 

[0127] This particular aspect of die present mvention provides a general mefiiod by 
^ch the regions of peptides likely to contain T-cell q)itopes can be described. 
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Modifications to the peptide in these regions have the potential to modify tiie MHC Qass 
n binding diaracteristics. 

[0128] According to another aspect of the present invention, T-cell epitopes can be 
predicted witii greats accuracy by tiie use of a more sophisticated computational method 
whidi takes into account the interactions of pq>tides with models of MHC Qass n 
alleles. 

[0129] The computational prediction of T-cell ^topes present within a peptide 
according to this particular aspect oontenq;>Iates tiie construction of models of at least 42 
MHC Class n alleles based upon the stmctures of all known MHC Class n molecules and 
a method for the use of these models in the conq)utational identification of T-cell 
epitopes, the constmction of libraries of peptide backbones for each model in order to 
allow for the known variability in relative peptide badd)one alpha carbon (Ca) portions, 
tiie construction of libraries of amino-add side chain conformations for each backbone 
dock wifli each model for each of the 20 amino-acid alternatives at positions critical for 
the interaction between peptide and MHC Class U molecule, and the use of tiiese libraries 
of backbones and side-chain conformations in conjunction with a scoring function to 
select the optimum backbone and side-chain confomiation for a particular peptide docked 
with a particular MHC Class n molecule and the derivation of a binding score fiom this 
interaction. 

[0130] Models of MHC Class n molecules can be derived via homology modeling from 
a number of similar structures found in the Brookhaven Protein Data Bank (TDB")* 
These maybe made by the use of semi-automatic homology modeling software 
(Modeller,SaK A. &BhnideUTL., 1993-/. Jl/o/Sfo/^:779-815) which incorporatesa 
simulated annealing function, in conjunction with die CHARMm force-field for energy 
minimization (available fiom Molecular Simulations Inc., San Diego, Ca). Alternative 
modeling methods can be utilized as well. 

[0131] The present method Offers significantly fiom other computational methods 
which use libraries of experimentally derived binding data of each amino-add alternative 
at each position in the binding groove for a small set of MHC Class n molecules 
(Marshall, K.W., et oL, Biomed. Pqpt Proteins Nucleic Adds, 1(3): 157-162) (1995) or 
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yet other computationa] methods which use similar experimental binding data in ord^ to 
define the binding characteristics of particular types of binding pockets within the groove, 
again using a relatively small subset of MHC Class n molecules, and then 'mixing and 
matching' pocket types firom this pocket library to artificially create fiirther 'virtual' 
MHC aass n molecules (Stumiolo T., et al„ Nat Biotech. 17(6): 555-561 (1999). Both 
prior methods suffer the major disadvantage that, due to the coBq)Iexity of the assays and 
the need to synthesize large numbers of pq)tide variants, only a small number of MHC 
Class n molecules can be experimentally scanned. Therefore the first prior method can 
only make predictions for a small number of MHC Class n molecules. The second prior 
method also makes the assumption that a pocket lined wi& similar amino*acids in one 
molecule will have flie same binding charactaistics when in the context of a different 
Class n allele and suffers further disadvantages in that only fliose MHC Class n 
molecules can be 'virtuall/ created which contain pockets contained within the pocket 
library. Using the modeling approach described hereb, tiie structure of any number and 
type of MHC Class II molecules can be deduced, therefore alleles can be ^edfically 
selected to be representative of &e global population. In additicm, the numb^ of MHC 
Class n molecules scanned can be increased by making fiirther models fiuther than 
having to generate additional data via complex experimentation. 
[0132] The use of a backbone library allows for variation in tibe positions of the Ca 
atoms of the various peptides being scanned when docked with particular MHC Class n 
molecules. This is again in contrast to flie alternative prior computational methods 
described above which rely on the use of simplified pqptide bacldx>nes for scanning 
amino-acid binding in particular pockets. These simplified bac]dK>nes are not likely to be 
representative of backbone conformations found in 'real' peptides leading to inaccuracies 
in prediction of peptide binding. Tlie presrat backbone library is created by superposing 
the backbones of all pq>tides bound to MHC Class n molecules found within the Protein 
Data Bank and noting the root mean square (RMS) deviation between the Ca atoms of 
each of the eleven amino-acids located within the bmding groove. While this library can 
be derived firom a small number of suitable available mouse and human stroctures 
(currently 13), in order to allow for ttie possibility of evra greater variability, the KMS 
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figure for each C"-a position is increased by 50%. The average Ca position of each 
amino-acid is then determined and a sphere drawn around this point whose radius equals 
the RMS deviation at that position plus 50%. This sphere represents all allowed Ca 
positions. 

[0133] Workmg from the Ca with the least RMS deviation (that of the amino-acid in 
Pocket 1 as mentioned above, equivalent to Position 2 of the 1 1 residues in the binding 
groove), the sphm is tfaree^dimensionally gridded, and eadi vertex within the grid is then 
used as a pos^ble location for a Ca of Ifaat amino-acid. The subsequent amide plane, 
corresponding to the peptide bond to the subsequent amino-acid is grafted onto each of 
these Cas and the ^ and \|f angles are rotated step-wise at set intervals in order to portion 
the subsequent Co. If flie subsequent Cd &lls wiQiin the 'sphere of allowed positions' 
for this Ca fhm the orientation of the dipeptide is accepted, whereas if it Ms outside the 
sphere then the dipeptide is rejected This process is then rq>eated for each of the 
subsequent Ca positions, such diat the p^tide grows fiom the Pocket 1 Ca 'seed', until 
all nine subsequent Cos have been positioned fiom all possible permutations of the 
preceding Cas* The process is then repeated once more for the single Ca preceding 
pocket 1 to create a library of backbone Ca positions located within the binding groove. 
[0134] The number ofbackbones generated is dq>endent upon several Victors: The size 
of the 'spheres of allowed positions'; flie fineness of the gridding of the 'primary sphere' 
at (he Pocket 1 position; the fineness of the step-wise rotation of die ^ and \f angles used 
to position subsequent Cas* Using this process, a large library of backbones can be 
created. The larger the backbone library, the more likely it will be that the optimum fit 
will be found for a particular peptide within die binding groove of an MHC Class n 
molecule, hiasmuch as all backbones will not be suitable for docking with all the models 
of MHC Class II molecules due to clashes with amino-acids of the binding domains, for 
each allele a subset of the hbrary is created comprising backbones which can be 
accommodated by that allele. The use of the backbone library, in conjunction with the 
models of MHC Class n moleciiles creates an exhaustive database consisting of allowed 
side chain conformations for each amino-acid in each position of the binding groove for 
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each MHC Class II molecule docked with each allowed backbone. This data set is 
generated using a simple steric overlE^ function where a MHC Class n molecule is 
docked with a backbone and an amino-acid side chain is grafted onto ttie backbone at the 
desired position. Each of flie rotatable bonds of the side chain is rotated step-wise at set 
intervals and the resultant positions of the atoms dependent \spon that bond noted. The 
interaction of the atom with atoms of side-chains of the binding groove is noted and 
positions are either accqjted or rejected according to the following criteria: The sum total 
of the ovctI^ of all atoms so fer positioned must not exceed a pre-determined value. 
Thus die stringency of the conformational search is a function of the interval used in the 
step-wise rotation of the bond and the pre-detamined limit for flie total overlsg). This 
latter value can be smaU if it is known that a particular pocket is rigid, however the 
stringency can be relaxed if the positions of pocket side-chains are known to be relatively 
flexible. Thus allowances can be made to imitate variations in flexibility within pockets 
of the binding groove. This conformational search is then repeated for every amino-acid 
at every position of each backbone when docked with each of the MHC Class II 
molecules to create fee exhaustive database of side-chain conformations. 
[0135] A suitable mathematical expression is used to estimate fee energy of binding 
between models of MHC Class n molecules in conjunction wife peptide ligand 
conformations which have to be empirically derived by scanning fee large database of 
backbone/side-chain conformations described above. Thus a protein is scanned for 
potential T-cell epitopes by subjecting each possible peptide of lengfe varying between 9 
and 20 amino-acids (alfeough fee lengfe is kept constant for each scan) to fee following 
computations: An MHC Class n molecule is selected togefeer wife a peptide backbone 
allowed for that molecule and fee side-chains correqwnding to fee desired peptide 
sequence are grafted on. Atom identity and interatomic distance data relating to a 
particular side-chain at a particular position on fee backbone are collected for each 
allowed conformation of feat amino-acid (obtained from fee database described above). 
This is repeated for each side-chain along fee backbone and peptide scores derived using 
a scoring function. The best score for feat backbone is retained and fee process repeated 
for each allowed backbone for fee selected modeL The scores from all allowed 



35 



wo 02/079415 



PCT/US02/09650 



backbones are compared and the highest score is deemed to be the peptide score for the 
desired peptide in that MHC Class II model. This process is then repeated for each model 
with every possible peptide derived from fho protein being scanned, and die scores for 
peptides versus models are displayed. 

[0136] In the context of the present invention, each ligand presented for die binding 
affinity calculation is an amino-acid segment selected firom a peptide or protdn as 
discussed above. Thus, the ligand is a selected stretch of amino acids about 9 to 20 amino 
adds in length derived from a peptide, polypeptide or protein of known sequence. The 
terms ''amino adds** and ^'residues*' are hereinafter regarded as equivalent temis. The 
ligand, in the form of the consecutive amino acids of the peptide to be examined grafted 
onto a backbone fix>m the backbone hbrary, is positioned in the binding cleft of an MHC 
Class n molecule firom the MHC Class n molecule model library via the coordinates of 
die C"-a atoms of the peptide backbone and an allowed conformation for each side-chain 
is selected from the database of allowed conformadons. The relevant atom identities and 
intexatonoic distances are also retrieved from this database and used to calculate the 
peptide binding score. Ligands with a high binding affinity for the MHC Class n binding 
pocket are flagged as candidates for site-directed mutagenesis. Amino-add substitutions 
are made in the flagged ligand (and hence in the protein of interest) which is then retested 
using the scoring fimction in order to determine changes which reduce the binding 
afBnity below a predetermined threshold value. These changes can then be incorporated 
ioto the protein of interest to remove T-cell epitopes. 

[0137] Binding betwem the peptide ligand and the binding groove of MHC Class 11 
molecules involves non-covalent interactions including, but not lunited to: hydrogen 
bonds, electrostatic interactions, hydrophobic (lipophilic) interactions and Van der Waal's 
interactions. These are included in the peptide scoring fimction as described in detail 
below. It should be understood that a hydrogen bond is a non-covalent bond which can be 
formed between polar or charged groups and consists of a hydrogen atom shared by two 
other atoms. The hydrogen of the hydrogen donor has a positive charge where the 
hydrogCT acceptor has a partial negative charge. For the purposes of peptide/protein 
interacti(»is, hydrogen bond donors may be either nitrogens with hydrogra attached or 
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hydrogens attached to oxygen or nitrogen. Hydrogen bond accq)tor atoms may be 
oxygens not attached to hydrogen, nitrogens with no hydrogens attached and one or two 
connections, or su^hnrs with only one connection. Certain atoms, such as oxygens 
attached to hydrogois or imine nitrogens (e.g. C=NH) may be both hydrogen acceptors or 
donors. Hydrogen bond energies range from 3 to 7 KcalAnol and are much stronger than 
Van der Waal's bonds, but weaker than covalent bonds. Hydrogen bonds are also highly 
directional and are at iheir strongest when the donor atom, hydrogen atom and acceptor 
atom are co-linear. Electrostatic bonds are formed between oppositely charged ion pairs 
and the strength of the interaction is inversely proportional to fhs square of ^e distance 
between the atoms according to Coulomb's law. The optimal distance between ion pairs 
is about 2.8A. In protein/^eptide interactions, electrostatic bonds may be formed between 
arginine, histidine or lysine and aspartate or glutamate. The strength of the bond will 
dq)end upon the pKa of the ionizing group and the dielectric constant of the medium 
although they are approximately similar in strength to hydrogen bonds. 
[0138] Lipophilic interactions are favorable hydrophobic-hydrophobic contacts that 
occur between he protein and peptide ligand. Usually, these will occur between 
hydrophobic amino acid side chains of the peptide buried within the pockets of the 
binding groove such that they are not exposed to solvent. Exposure of the hydrophobic 
residues to solvent is highly unfavorable since flie surrounding solvent molecules are 
forced to hydrogen bond with each other forming cage-like clathrate structures. The 
resultant decrease in entropy is highly unfavorable. Lipophilic atoms may be sulphurs 
which are neither polar nor hydrogen acceptors and carbon atoms which are not polar. 
[0139] Van der Waal's bonds are non-specific forces found between atoms which are 3- 
4A apart They are weaker and less specific flian hydrogen and electrostatic bonds. The 
distribution of electronic charge around an atom changes with time and, at any instant, the 
charge distribution is not symmetric. This transient asymmetry in electronic charge 
induces a similar asynometry in neighboring atoms. The resultant attractive forces 
between atoms reaches a maximum at the Van der Waal's contact distance but diminishes 
very rapidly at about 1 A to about 2A. Conversely, as atoms become separated by less 
than the contact distance, increasingly strong rq>ul5ive forces become dominant as the 
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outer electron clouds of the atoms overb^. Although the attractive forces are relatively 
weak compared to electrostatic and hydrogen bonds (about 0.6 Kcal/mol), the rq)u]sive 
forces in particular may be very important in detetnniuing whether a pq)1ide ligand may 
bind successfully to a protem. 

[0140] In one embodiment tihe B5hm scoring function (SCOKEl ^preach) is used to 
estimate the binding constant (Bohm, HJ.» /* Comput Aided Mol Des.^ 8(3):243-256 
(1994) which is hereby incorporated in its entirety). In another embodiment, the scoring 
fimction (SC0RE2 ^roach) is used to estimate the binding afGnities as an indicator of a 
ligand containing a T-cell q)itope (Bdhm, HJ., J. Comput Aided Mol Des., 12(4):309- 
323 (1998) which is hereby incorporated in its entirety). However, the Bdhm scoring 
functions as described in the above references arc used to estimate the binding afBnity of 
a ligand to a prctein where it is already known that tbe ligand successfully binds to the 
protein and the protein/ligand conq}Iex has had its structurc solved, the solved structure 
being present in die Protein Data Bank ^TDB"). Therefore, the scoring function has 
been developed with the braefit of known positive binding data. In order to allow for 
discrimination between positive and negative binders, a repulsion tenn must be added to 
the equation. In addition, a more satis&ctory estimate of binding energy is achieved by 
computing the lipophilic interactions in a pairwise manner rather than using the arca 
based energy term of the above B5hm functions. Therefore, in a preferred embodiment, 
the binding energy is estimated using a modified Bdhm scoring function. In the modified 
B6hm scoring function, the bmding energy between protein and ligand (AGumO is 
estimated consid^ing the following parameters: The reduction of binding energy due to 
the overall loss of translafional and rotational entropy of the ligand (AGo); contributions 
from ideal hydrogen bonds (AGhb) where at least one partner is neutral; contributions 
from unperturbed ionic interactions (AGiooie); lipophihc interactions betwera lipophilic 
ligand atoms and lipophilic acceptor atoms (AGupo); the loss of binding energy due to the 
freezing of internal degrees of freedom in the ligand, i.e., the freedom of rotation about 
each C*C bond is reduced (AQoO; the en^gy of the interaction between the protein and 
ligand (Evdw). Consideration of these terms gives equation 1 : 
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(AGbiad) = ( AGo) + ( AGia>XNhb) + ( AGipnicXNionic) + ( AGiipoXNiipo) + ( 
AGtot+Nrot) + (B vdfr) . 

Wliere N is the number of qualifying ii^^ 

embodiment, AGq, AGiih» AGbnie* AGfi|N> and A(^ are constm^ 

values: 5.4, -4 J, -4.7, -0.17, and 1.4, respectively. 

The tenn Nu) is calculated according to equation 2 : 

Nub « Zk-boadsf (AR, Aa) X f (Nneighb) 3C fpcs 

fl[AR, Aa) is a penalty function ^ch accounts for large deviations of hydrogen 
bonds from ideality and is calculated according to equation 3 : 
flAR, A-a) = fl{AR) x f2(Aa) 

Where: f l (AR) - 1 if AR <» TOL 

or =1 - (AR - TOL)/0.4 if AR <» 0.4 + TOL 

or =0 if AR >0.4 + TOL 
And: f2{Aa) « l if Aa <30<* 

or =l-( Aa - 30) /50 if Aa <=80» - 

or =0 if Aa >80» 
TOL is flie tolerated deviation in hydrogen bond length - 0.25 A 
AR is the deviation of the H*0/N hydrogen bond length from the ideal value = 1 .9A 
Aa is the deviation of the hydrogen bond angle Z wo^o/h fi^om its idealized value of 
ISO' 

fCNnd^) distinguishes between concave and convex parts of a protein surfiice and 
therefore assigns greater wei^ to polar interactions found inpodcets ra&ertiian 
those found at the protein surfice. This fimction is calculated according to equation 4 
below: 

f (Naeishb) = (Nneigub/Nnelglib.o) " WherS a « 0.5 

NndgKb is the numbCT of non-hydrogen protein atoms that are closer than 5 A to any 
given protein atom. 
Niidghb.0 is a constant « 25 
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is a function which allows for the polar contact surface area per hydrogen bond 
and therefore distinguishes between strong and weak hydrogen bonds and its value is 
detennined according to the following criteria: 
fpcs» SwhenApoiar/NHB < 10 
or fpcB= 1 whenApoiar/NHB > loA* 
Apotar is the size of the polar protein-ligand contact sur&ce 
Nhb is ttie number of hydrogen bonds 
fi is a constant whose value = 1.2 

For the implCTientation of the modified B5hm scoring function, the contributiom 
&om ionic interactions, AGknio ^ computed in a similar fashion to those fiom 
hydrogen bonds described above since the same geometry dqiendency is assumed 
The tern N]^ is calculated according to equation S below: 

^r^ is calculated for all Upophilic ligand atoms, 1, and all lipophilic protein atoms, 
according to the following criteria: 

f (riJ »1 when ru, <« Rlf (rn) -{ru. - R1)/(R2-R1) when R2 <rix, > 
Rl 

f (riL) »0 when rit >= R2 
Where: Rl = ri"^ + Tl"^* + 0,5 
and R2 = Rl + 3.0 

and ri^'*' is the Van der Waal's radius of atom 1 
and Tt^^ is the Van der Waal's radius of atom L 

The term Nrot is the number of rotable bonds of the amino acid side chain and is taken 
to be the number of acyclic sp^ • sp^ and sp^ - sp^ bonds. Rotations of terminal -CHa 
or -NH3 are not taken into account 

The final term, EvdWi is calculated according to ecpiation6 below: 
Evdw - ex82((rx"*' +r2^^)"/r" - (rx^ +r2^*)Vr«), where: 
El and 82 are constants dependent upon atom identity 
^vAw ^^vdw Waal's atomic radii 

r is the distance betwem a pair of atoms. 
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[0141] With legard to Equation 6, in one embodiment, the constants 8i and 82 are given 
the atom values: C: 0.245, N: 0.283, 0: 0.316, S: 0.316, respectively (i.e. for atoms of 
Caibon, Nitrogen, Oxygen and Sulphur, respectively). Witii regards to equations 5 and 6, 
the Van der Waal's radii are pven the atom values C: 1.85, N: 1.75, 0: 1.60, S: 2.00A. 
[0142] It should be umderstood that all predeteimined values and constants given in tiie 
equations above are detemiined within the constraints of cunent understandings of 
protein ligand interactions with particular regard to the type of computation being 
undertaken ho^in. Therefore, it is posdble that, as this scoring function is refined 
fiirtiier, these values and constants may change hence any suitable numerical value which 
gives tiie desired results in tenns of estsmatmg the binding energy of a protein to a ligand 
may be used and hence fall within tiie scope of the present invention. 
[0143] As described above, the scoring function is qrpUed to data extracted from the 
database of side-chain conformations, atom identities, and interatomic distances. For the 
purposes of the present description, the number of MHC Class TL molecules included in 
this database is 42 models plus four solved structures. It should be s^parent &om the 
above descriptions that the modular nature of the construction of the computational 
method of the present invention means that new models can simply be added and scanned 
with the peptide backbone Ubrary and side-chain conformational search ftmction to create 
additional data sets which can be processed by the peptide scoring function as described 
above. This allows for the repertoire of scanned MHC Class II molecules to easily be 
increased, or structures and associated data to be replaced if data are available to create 
more accurate models of the existing alleles. 

[0144] The present prediction method can be calibrated against a data set comprising a 
large number of peptides whose afBnity for various MHC Class n molecules has 
previously been experimentally determined. By comparison of calculated versus 
experimental data, a cut of value can be detemiined above which it is known ttiat all 
exp^imentally determiiied T-ceU epitopes are correctly predicted. 
[0145] It should be understood that, although the above scoring fimction is relatively 
simple compared to some sophisticated methodologies tiiat are available, tiie calculations 
are performed extremely rapidly. It should also be understood that the objective is not to 
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calciilate the tnie binding energy pe^ se for each pq>tide docked in the binding groove of 
a selected MHC Class n protein. The underlying objective is to obtain comparative 
binding energy data as an aid to predicting the location of T-cell epitopes based on the 
primary structure (i.e. amino add sequence) of a selected protein. A relatively high 
binding energy or a binding energy above a selected threshold value would suggest flie 
presence of a T-cell epitope in the ligand The ligand may then be subjected to at least 
one round of amino-acid substitution and the binding energy recalculated* Due to the 
nq>id nature of the calcidations, these manipulations of the peptide sequence can be 
performed interactively within &e program's user interface on cost-effectively available 
computer hardware. Major investment in computer hardware is thus not required. 
[0146] It would be ^parent to one skilled in the art that other available software could be 
used for the same purposes. In particular, more sophisticated software which is cq>able 
of docking ligands into protein binding-sites may be used in conjimction with energy 
minimization. Exanq>les of docking software are: DOCK (Kuntz et aLy J. Mol BioLj 
161:269-288 (1982)), LUDI (B(5hm, HJ., 7. Comput Aided Mol Des., 8:623-632 (1994)) 
and FLEXX (Rarey M., et al., ISMB, 3:300-308 (1995)). Exanq)les of molecular 
modeling and manipiilation software include: AMBER (Tripos) and CHARMm 
(Molecular Simulations Inc.). The use of these computational methods would severely 
limit the throughput of the method of this invention due to the lengths of processing time 
required to make the necessary calculations. However, it is feasible that such me&ods 
could be used as a 'secondary screen' to obtain more accurate calculations of binding 
energy for peptides which are found to be 'positive binders' via the method of the present 
invention. The limitation of processing time for sophisticated molecular medianic or 
molecular dynamic calculations is one which is defined both by tfie design of the software 
which makes these calculations and the current technology limitations of computer 
hardware. It may be anticipated that, in the ftiture, with the writing of more efiBcient code 
and the continuing increases in speed of computer processors, it may become feasible to 
make such calculations within a more manageable time-frame. Further infomiation on 
energy functions applied to macromolecules and consideration of the various interactions 
that take place within a folded protein structure can be found in: Brooks, B.R., et al., J. 
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Comput, Chem., 4:187-217 (1983) and further infonnation concenung general protem- 
figand interacdons can be found in: Dauber-Osguflioipe et al.» /Vi9^^m54(l):31-47(1988X 
^duc^ are incorporated herdn by reference in thekenti^^ Useful background 
information can also be found, for exanq;>le, in Fasman, GD., ed. Prediction of Protein 
Structure and the Princq>Ies of Protein Conformation, Plenum Press, New York, ISBN: 
0-306 4313-9. 

Equivalents 

[0147] The invention may be embodied in other specific forms without departing fiom 
the spirit or essential characteristics thereof The foregoing embodiments are therefore to 
be considered in all respects illustrative rather than limiting on the invention described 
hereiiL Scope of the mvention is thus indicated by die appended claims raOier than by die 
foregoing description, and all changes which come within the meaning and range of 
equivalency of the claims are intended to be embraced therein. 

Incorporation bv Reference 
[0148] All patents, patent s^licadons, and scientific publications mentioned herein 
above are incorporated by reference into this application in their mtirety. 
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CLAIMS 

What is claimed is: 

1 1. A method for reducing the iimmmogemd 

2 comprising: 

3 i. identifying a candidate T-cell epitope within a junction region spanning a 

4 fusion junction of a fusion protein; and, 

5 ii. changing an amino acid wiftinthejunction region to reduce the ability of the 

6 candidate T-cell epitope to mteract with a T cell recq>tor. 
1 2; A fusion protein produced by the method of claim 1 . 

1 3. A method for reducing the immunogenicity of a fusion protein, the method 

2 comprising changmg a candidate T-cell epitope within a junction region spanning 

3 a fusion junction of a fusion protein to reduce the ability of the candidate T-cell 

4 epitope to interact with a T cell receptor 

5 i. T-cell epitope 

1 4. A fusion protein produced by the method of claim 3. 

1 S. A method for reducing the immunogenicity of a fusion protein, the method 

2 conq)rising introducing a glycosylation site wifliin the junction region spai^^ 

3 the 

4 fizsionjunction. 

1 6. A method for reducing the immunogenicity of a fusion protein, the me&od 

2 comprising introducing a glycosylation site within 10 amino acids of the fusion 

3 junctioa 

1 7. A method of claim 5, the method conq)risrng introducing a glycosylation site 

2 widiin S amino acids of the fusion junction. 

1 8. A method of claim the method comprising introdudng a glycosylation site 

2 within 2 amino acids of the fusion junction. 

1 9. A method for reducing the immunogenicity of a fusion protein, Qie method 

2 comprising the stq>s of: 
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3 i. providing a fusion protein with a jimctioniegionocmq^ 

4 amino acid; and 

5 ii. assaying said fusion protein in an inmiiinogenicity assay. 

1 10. A fusion protein produced by the metiiod of claim 5, 6, 7, 8, or 9. 

1 11. A method of claim 5-8, wherein the glycosylation is an N-Iinked glycosjdatian. 

1 12. A method of claim 5-8, wherein the glycosylation is an 0-Iinked glycosylation. 

1 13. Afiisionprot^of claim 2, 4, or 9, wherein the protein conqxrises an Ig 

2 rogion. 

1 14. A fusion protein ofclaim 2, 4, or 9, wherein the protein conq^iises a soum 

2 albumin rogion. 

1 15. A fiision protein of claim 2, 4, or 9, wherein the protein conq^rises a cytokine 

2 activity. 

1 16. A fusion protein of claim 2, 4, or 9, wherein the protein comprises a hormone 

2 activity. 

1 17. A fosion protein of claim 13, wherein the Ig region, the Ig re^on comprises 

2 sequences of more than one antibody isotype. 

1 18. A fusion protein with reduced immunogenicity comprising 

2 a first proteia; and, 

3 a second protein linked to said first protein via a fusion junction, 

4 wherein the amino acid sequence of a junction rogion surrounding the fusion 

5 junction is modified to remove a non-self T-cell epitope. 

1 19. The fusion protein of claim 18, wheroin the junction rogion comprises between 

2 1 and2S amino acids. 

1 20. The fusion protein of claim 18, wh^ein the junction region comprises between 

2 1 and 15 amino acids. 

1 21. The fusion protein of claim 18, wherein the junction region con^irises between 

2 1 and 9 anuno acids. 

1 22. The fusion protein of claim 1 8, wherein the junction rogion conqmses an 

2 linked or an O-linked glycosylation site. 
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1 23. The fiision protdn of claim 1 8, whraein the junction legion comprises a spacer 

2 or linker. 

1 24. The fusion protein of claim 1 8» wherein the junction region comprises an Asn- 

2 X-Ser/Thr-Gly-amino acid sequence, wherein X is any amino acid 

1 25. The fusion protein of claim 1 8, wherein the first protein conqmses an 

2 molecule or a fi:agment thereof 

1 26. The fusion protein of claim 25, wherein the C-texxninus of said Ig molecule or 

2 fragment thereof is linked to the N*tem±ius of said second protein. 

1 27. The fusion protein of claim 18, wherein the junction region comprises an IgG 

2 sequence having an ATAT amino acid sequence instead of an LSLS amino add 

3 sequence. 

1 28. The fiision protein of claim 25, wherein the Ig molecule or fragment thereof 

2 comprises an Fc molecule. 
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3 29. The fiision ptotein of claim 25» wherein the Ig molecule or firagment ttiereof 

4 comprises amino add sequences from two ant9x)dy isotypes. 

1 30. The fusion protein of claim 29, wherein the Ig molecule or firagment ttiereof 

2 comprises IgGl and IgG2 amino acid sequences. 

1 31. The fusion protein of claim 18, wherein the second protein has cytokme 

2 activity. 

1 32. The fusion protein of claim 18, wherein the second protein has hormone 

2 activity. 

1 33. The fusion protein of claim 18, wherein the first protein is an albumin protein. 

1 34. A method ofredttdng the iinmunogemdtyofa fusion protein, &e method 

2 comprising: 

3 i. identifying an aixuno add in a pq>tide in ajunction region, whereui the 

4 amino add is selected from flie group consisting of a leucine, a valine, an isoleudne, a 

5 methionine, a phenylalanine, a tryptophan and a tyrosine; and 

6 ii. changing the amiru) acid in the pq)tide, such that the ability of the 

7 peptide to bind to MHC Class n is reduced. 

1 35. A fusion protdn produced by the method of claim 34. 

1 36. A nucldc acid encoding a fusion protein with reduced immunogenicity, the 

2 fiidon 

3 protein comprising: 

4 i. a first protein; 

5 ii. a second protein linked to the first proteui via a fusion junction, 

6 wherein the anoim add sequence of a junction region spaxming the 

7 fiision junction is modified to remove a non-self T-cell epitope. 

1 iii. 

2 37. A nucleic add of claim 36, wherein the junction region comprises a linker. 
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SEQUENCE LISTING 

<110> Lexigen Pharmaceuticals Corp. 

<120> Reducing the Iimaunogenicity of Fusion Proteins 

<130> LEX-017PC 

<150> OS 60/280^625 
<151> 2001-03-30 

<160> 60 

<170> Patent In version 3.0 

<210> 1 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> human Ig gamma heavy chain C region 
<400> 1 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys 
15 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 70 75 80 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys 

85 90 95 

Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 
100 105 110 

Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro 
115 120 125 

Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys 
130 135 140 

Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
145 150 155 160 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 
165 170 175 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 
180 185 190 

His Gin Asp Trp Leu Aan Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
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195 200 205 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly 
210 215 220 

Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu 
225 230 235 240 

Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
245 ' 250 255 

Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn 
260 265 270 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
275 280 285 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 
290 295 300 

Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
305 310 315 320 

Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys - ■ 

325 330 

<210> 2 

<211> 326 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> human Ig gainiaa-2 chain C region 



<400> 2 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 

15 10 15 

Ser Thr Ser Glu Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val' His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr 
65 70 75 80 

Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Thr Val Glu Arg Lys Cys Cys Val Glu Cys Pro Pro Cys Pro Ala Pro 
100 105 110 

Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp 
115 120 125 

Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
130 135 140 
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Val Ser His Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly 
145 150 155 160 

Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn 
165 170 175 

Ser Thr Phe Arg Val Val Ser Val Leu Thr Val Val His Gin Asp Trp 
180 185 190 

Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro 
195 200 205 

Ala Pro lie Glu Lys Thr lie Ser Lys Thr Lys Gly Gin Pro Arg Glu 
210 215 220 

Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn 
225 230 235 240 

Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie 
245 250 255 

Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 
260 265 270 

Thr Pro Pro Met Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys 
275 280 285 

Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys 
290 295 300 

Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu 
305 310 315 320 

Ser Leu Ser Pro Gly Lys 
325 

<210> 3 

<211> 362 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> human Ig3 constant region 



<400> 3 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
1 5 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 70 75 80 
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Tyr Xhr Cys Asn Val TVsn His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Arg Val Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr His Thr Cys Pro 
100 105 110 

Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg 
115 120 125 

Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys 
130 135 140 

Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro 
145 150 155 160 

Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys 
165 170 175 

Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Gin Phe Lys Trp 
180 185 190 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Trp Glu 

195 200 205 

Glu Gin Tyr Asn Ser Thr Phe Arg Val Val Ser Val Leu Thr Val Leu 
210 215 .220 

His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
225 230 235 240 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr He Ser Lys Thr Lys Gly 
245 250 255 

Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu 
260 265 270 

Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
275 280 285 

Pro Ser Asp He Ala Met Glu Trp Glu Ser Ser Gly Gin Pro Glu Asn 
290 295 300 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
305 310 315 320 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 
325 330 335 

He Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
340 345 350 

Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
355 360 

<210> 4 

<211> 327 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Ig gainina-4 chain C region 
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<400> 4 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
1 5 10 15 

Ser Thr Ser Glu Ser Thi' Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 . 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 *55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Lys Thr 
65 70 75 80 

Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Arg Val Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro Ala Pro 
100 105 110 

Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys 
115 120 125 

Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val 
130 135 140 

Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp 
145 150 155 160 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe 
165 170 175 

Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp 
180 185 190 

Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu 
195 200 205 

Pro Ser Ser He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg 
210 215 220 

Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Het Thr Lys 
225 230 235 240 

Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp 
245 250 255 

He Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys 
260 265 270 

Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser 
275 280 . 285 

Arg Leu Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser 
290 295 300 

Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser 
305 310 315 320 

Leu Ser Leu Ser Leu Gly Lys 
325 
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<210> 5 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope 
<400> 5 

Lys Ser Leu Ser Leu Ser Pro Gly Lys 
1 5 

<210> 6 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> mutated potential T cell epitope 
<400> 6 

Lys Ser Ala Thr Ala Thr Pro Gly Lys 
1 5 

<210> 7 

<211> 9 

<212> PRT 

<213> Artificial 

<220> , 

<223> mutated potential T ce.ll epitope 
<400> 7 

Lys Ser Ala Thr Ala Thr Pro Gly Ala 
1 5 

<210> 8 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> seqpience near the C- terminus of CH3 danain of the IgA Fc region * 
<400> 8 

Gin Lys Thr lie Asp Arg Leu Ala Gly Lys Pro Thr His 
15 10 

<210> 9 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> mutated sequence near the C-terminus of CH3 domain of the IgA Fc 
region 

<400> 9 
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Gin Lys Thr Ala Asp Arg Thr Ala Gly Lys Pro Thr His 
15 10 

<210> 10 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> deimmiinized sequence in an IgA-X fusion 
<400> 10 

Gin Lys Thr Pro Thr Arg Thr Ala Gly Lys Pro Thr His 
1 5 10 

<210> 11 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> deinmunized sequence in an IgA-X fusion 
<400> 11 

Gin Lys Thr Pro Thr Arg Pro Ala Gly Lys Pro Thr His 
15 10 

<210> 12 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> deiinmunized sequence in an IgA-X fusion 
<400> 12 

Gin Lys Thr Ala Thr Arg Thr Ala Gly Lys Pro Thr His 
15 10 

<210> 13 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in the HSA-IFtilalpha junction 
<400> 13 

Lys Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
1 5 * 10 

<210> 14 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> sequence in the C-tenninus of albumin 
<400> 14 
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Lys Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
15 10 

<210> 15 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> modified C-terininus of albuxain 
<400> 15 

Lys Lys Leu Val Ala Ala Ser Gin Ala Ala Thr Thr Ala 
15 10 

<210> 16 

<211> 30 

<212> PRT 

<213> Artificial 

<220> 

<223> sequence in an Fc fusion protein 
<400> 16 

His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 

<210> 17 
<211> 30 
<212> PRT 
<213> Artificial 

<220> 

<223> modified sequence in an Fc fusion protein 
<400> 17 

His Asn His Tyr Thr Gin Lys Ser Ala Thr Ala Thr Pro Gly Lys Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 

<210> 18 
<211> 9 
<212> PRT 
<213> Artificial 

<220> 

<223> junction sequence 
<400> 18 

Leu Ser Leu Ser Pro Gly Lys Ala Pro 
1 5 

<210> 19 
<211> 9 
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<:tx2-^ PRT 

<213> Artificial 

<220> 

<223> modified junction secjuence 
<400> 19 

Ala Thr Ala Thr Pro Gly Ala Ala Pro 
1 5 

<210> 20 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> modified jvinction sequence 
<400> 20 

I*eu Asn Leu Ser Pro Gly Ala Ala Pro 
1 5 

<210> 21 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic peptide containing a reactive epitope 
<400> 21 

Lys Ser Leu Ser Leu Ser Pro Gly Lys Ala Pro Thr Ser 
15 10 

<210> 22 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> modified synthetic peptide containing a reactive epitope 
<400> 22 

Lys Ser Ala Thr Ala Thr Pro Gly Lys Ala Pro Thr Ser 
15 10 

<210> 23 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in HSA-IFNalpha fusion 
<400> 23 

Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu Cys 
1 5 10 

<210> 24 
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<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in HSA-IFNalpha fusion 
<400> 24 

Leu Gly Leu Cys Asp Leu Pro Gin Thr His Ser Leu Gly 
15 10 

<210> 25 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> C-terminus albumin sequence 
<400> 25 

Ala Ala Leu Gly Leu 
1 5 

<210> 26 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> mutated C-terminus albumin sequence 
<400> 26 

Thr Ala Thr Thr Ala 

<210> 27 

<211> 19 

<212> PRT 

<213> Artificial 

<220> 

<223> modified albumin junction region 
<400> 27 

Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gin Thr Ala 
1 5 10 15 

Thr Thr Ala 



<210> 


28 


<211> 


21 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


non-mutant 


<400> 


28 
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Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
15 10 15 

Pro Glu Leu Leu Gly 
20 

<210> 29 
<211> 52 
<212> PRT 
<213> Artificial 

<220> 

<223> GLP-l-mutant Fc fusion junction 
<400> 29 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly Glu 
20 25 30 

Pro Lys Ser Ser Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro 
35 40 45 

Glu Leu Leu Gly 
50 

<210> 30 
<211> 41 
<212> PRT 
<213> Artificial 

<220> 

<223> GLP-l-normal Fc fusion, junction 
<400> 30 

Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val 
15 10 15 

Lys Gly Arg Gly Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro Ala Pro Glu Leu Leu Gly 
35 40 

<210> 31 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope at the GLP-l-Fc fusion junction 
<400> 31 

Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly Glu Pro 
15 10 

<210> 32 

<211> 13 

<212> PRT 

<213> Artificial 
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<220> 

<223> potential T cell epitope in GLP-l-Fc fusion junction 
<400> 32 

Ala Trp Leu Val Lys Gly Arg Gly Glu Pro Lys Ser Ser 
15 10 

<210> 33 

<21.1> 52 

<212> PRT 

<213> Artificial 

<220> 

<223> deimmunized' GLP-lFc fusion junction 
<400> 33 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
15 10 15 

Gin Ala Ala Lys Glu Phe Ala Ala Trp Ala Val Thr Gly Thr Gly Glu 
20 25 30 

Pro Lys Ser Ser Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro 
35 40 45 

Glu Leu Leu Gly 
50 

<210> 34 
<211> 41 
<212> PRT 
<213> Artificial 

<220> 

<223> GLP-l-Fc fusion junction with a glycosylation site 
<400> 34 

Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val 
15 10 15 

Lys Gly Arg Asn Gly Ser Lys Ser Ser Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro Ala Pro Glu Leu Leu Gly 
35 40 

<210> 35 

<211> 41 

<212> PRT 

<213> Artificial 

<220> 

<223> TNF-R-gamma-l fusion junction 
<400> 35 

Ser Thr Ser Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly 
15 10 15 

Ser Thr Gly Asp Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
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20 



25 



30 





35 40 


<210> 


36 


<211> 


41 


<212> 


PRT 


<213> 


Artificial 


<22C> 




<223> 


TNF-R-Fc fusion junction 


<400> 


36 



Ser Thr Ser Phe Leu Leu Pro Met Gly Pro Sex 
15 10 

Ser Thr Gly Asn Gly Ser Lys Ser Cys Asp Lys 
20 25 

Pro Cys Pro Ala Pro Glu Leu Leu Gly 
35 40 

<210> 37 

<211> 40 

<212> PRT 

<213> Artificial 

<220> 

<223> modified Fc-IL12p35 fusion junction 
<400> 37 



Pro Pro Ala Glu Gly 
• 15 

Thr His Thr Cys Pro 
30 



Met His Glu Ala Leu His Asn His Tyr Thr Gin 
1 5 10 

Thr Pro Gly Lys Arg Asn Leu Pro Val Ala Thr 
20 25 



Lys Ser Ala Thr Ala 
15 

Pro Asp Pro Gly Met 
30 



Phe Pro Cys Leu His His Ser Gin 
35 40 

<210> 38 

<211> 40 

<212> PRT 

<213> Artificial 



<220> 

<223> IL-12p40-IL2 fusion junction 
<400> 38 

Arg Ala Gin Asp Arg Tyr Tyr Ser Ser Ser Trp 
15 10 

Val Pro Cys Ser Ala Pro Thr Ser Ser Ser Thr 
20 25 

Gin Leu Glu His Leu Leu Leu Asp 
35 40 



Ser Glu Trp Ala Ser 
15 

Lys Lys Thr Gin Leu 
30 



<210> 39 
<211> 38 
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<2i2> PRT 

<213> Artificial 

<220> 

<223> albumin-CD4 fusion junction 
<400> 39 

Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gin Ala 
15 10 15 

Ala Leu Gly Leu Lys Lys Val Leu Gly Lys Lys Gly Asp Thr Veil Glu 
20 25 30 

Leu Thr Cys Thr Ala Ser 
35 

<210> 40 
<211> 40 
<212> PRT 
<213> Artificial 

<220> 

<223> modified IL12p40-IL2 fusion junction 
<400> 40 

Arg Ala Gin Asp Arg Tyr Tyr Ser Ser Ser Trp Ser Glu Trp Ala Ser 
15 10 15 

Val Pro Cys Ser Asn Gly Thr Ser Ser Ser Thr Lys Lys Thr Gin Leu 
20 25 30 

Gin Leu Glu His Leu Leu Leu Asp 
35 40 

<210> 41 

<211> 13 • 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope at the GLP-l-Fc fusion junction 

<400> 41 

Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly Glu 
15 10 

<210> 42 

<2H> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in HSA-IFNalpha junction 
<400> 42 

Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu Cys 
15 10 

<210> 43 
<211> 13 
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<213> Artificial 

<220> 

<223> potential T cell epitope in IU2p40-IL2 fusion junction 
<400> 43 

Ala Ser Val Pro Cys Ser Asn Gly Thr Ser Ser Ser Thr 
1 5 10 

<210> '44 
<211> 41 
<212> PRT 
<213>. Artificial 

<220> 

<223> IL4-FC fusion junction 
<400> 44 

Glu Asn Phe Leu Glu Arg Leu Lys Thr lie Met Arg Glu Lys Tyr Ser 
15 10 15 

Lys Cys Ser Ser Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
20 25 30 





35 40 


<210> 


45 


<211> 


40 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Fc-GMCSF fusion junction 


<400> 


45 



Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu 
15 10 15 

Ser Pro Gly Lys Pro Ala Arg Ser Pro Ser Pro Ser Thr Gin Pro Trp 
20 25 30 

Glu His Val Asn Ala lie Gin Glu 
35 40 

<210> 46 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T-cell epitope at IL4-Fc fusion junction 
<400> 46 

Glu Lys Tyr Ser Lys Cys Ser Ser Glu Pro Lys Ser Cys 
15 10 

<210> 47 
<211> 41 
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<213> Artificial 
<220> 

<223> modified IL4-Fc fusion 
<400> 47 

Glu Asn Phe Leu Glu Arg Leu Lys Thr He Met Arg Glu. Lys Tyr Ser 
1 5 10 15 

Lys Cys Ser Ser Thr Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro Ala Pro Glu Leu Leu Gly 
35 40 

<210> 48 

<211> 40 

<212> PRT 

<213> Artificial 

<220> 

<223> de immunized Fc-GMCSF fusion junction 

<400> 48 

Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Ala Thr Ala 
1 5 10 15 

Thr Pro Gly Lys Pro Ala Arg Ser Pro Ser Pro Ser Thr Gin Pro Trp 
20 25 30 

Glu His Val Asn Ala He Gin Glu 
35 40 

<210> 49 

<2H> 35 

<212> PRT 

<213> Artificial 

<220> 

<223> IgG2CHl-IgGlhinge fusion junction 
<400> 49 

Gin Thr Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Val 
1 5 10 15 

Asp Lys Thr Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro 
35 

<210> 50 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in the IgG2CHl-IgGl hinge fusion junctio 
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<40C> 50 

Thr Lys Val Asp Lys Thr Val Glu Pro Lys Ser Cys Asp 
15 10 

<210> 51 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in the IgG2CHl-IgGl hinge fusion junctio 



<400> 51 

Lys Thr Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
15 10 

<210> 52 

<211> 35 

<212> PRT 

<213> Artificial 

<220> 

<223> IgGlhinge-IgG2CH2 fusion junction 
<400> 52 

Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
15 10 15 

Pro Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys 
20 25 30 

Asp Thr Leu 
35 

<210> 53 

<211> 35 

<212> PRT 

<213> Artificial 

<220> 

<223> modified Ig62CHl-IgGlhinge fusion junction 
<400> 53 

Gin Thr Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Ala 
15 10 15 

Asp Lys Thr Ala Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro 
35 

<210> 54 
<211> 35 
<212> PRT 
<213> Artificial 

<220> 

<223> modified IgG2CHl-IgGlhinge fusion junction 
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<400> 54 

Gin Thr Tyr Thr Cys Asn Val Asp His Lys Pro Sex Asn Thr Lys Val 
1 5 10 15 

Asp Lys Thr Val Glu Pro Lys Ser Ser Asp Lys Thr His Thr Cys Pro 
20 25 30 

Pro Cys Pro 
35 

<210> 55 
<211> 35 
<212> PRT 
<213> Artificial 

<220> 

<223> modified IgGlhinge-IgG2CH2 fusion junction 
<400> 55 

Glu Pro Lys Ser Ser Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
15 10 15 

Pro Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys 
20 25 30 

Asp Thr Leu 
35 

<21-0> 56 
<211> 166 
<212> PRT 
<213> Artificial 

<220> 

<223> mutant EPO se<juence 
<400> 56 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn lie Thr Thr Gly Cys Ala Glu Gly 
20 25 30 

Pro Ser Leu Asn Glu Asn lie Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
^5 70 75 . 80 

Leu Val Asn Ser Ser Gin Pro Cys Glu Gly Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala lie Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 
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Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

<210> 57 

<211> 17 

<212> PRT 

<213> Artificial 

<220> 

<223> CH3-EP0 fusion junction 
<400> 57 

Thr Gin Lys Ser Ala Thr Ala Thr Pro Gly Ala Ala Pro Pro Arg Leu 
15 10 15 

He 



<210> 58 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> potential T cell epitope in HSA-IFNalpha junction 

<400> 58 

Leu* Gly Leu Cys Asp Leu Pro Gin Thr His Ser Leu Gly 
15 10 

<210> 59 

<211> 8 

<212> PRT 

<213> Artificial 

<220> 

<223> IgG2 CH3 sequence 

<400> 59 

Lys Ser Leu Ser Leu Ser Pro Gly 
1 5 

<210> 60 

<211> 8 

<212> PRT 

<213> Artificial 

<220> 

<223> modified IgG2CH3 sequence 

<400> 60 

Lys Ser Ala Thr Ala Thr Pro Gly 
1 5 
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